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FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to a method of identifying peptides of a 
desired origin and which are capable of binding to MHC molecules of a 

20 particular haplotype; peptides identified by the method; pharmaceutical 
compositions containing the peptides, databases describing the peptides and 
the use of the peptides in vaccination. 

The following abbreviations are used herein: MHC, Major 
Histocompatibility Complex; 02m, p2-microglobulin; ESI, electrospray 

25 ionization; MS, mass spectrometry; m/z, mass charge ratio; CID, collision 
induced disintegration; MS/MS, tandem mass spectrometry; MTDM, DNA 
methyl transferase; FAS, fatty acid synthase; CTL, cytotoxic T lymphocytes; 
mAbs, monoclonal antibodies. 

The MHC serves as a shuttle to transport and display peptide antigens 

30 on the surface of cells as an indication to the immune system of the health state 
of the cells. Each individual has at most six different MHC class-I haplotypes, 
out of the hundreds known. MHC bound peptides, i.e., peptides bound to, and 
presented in context of, MHC molecules, originate from proteolysis of most of 
the proteins expressed in the cells. Therefore, unique sets of MHC bound 

35 peptides are displayed by each of the different MHC haplotypes according to 



the protein expression and degradation schemes of the cells and according to 
the peptide binding motifs of the MHC molecules (reviewed in [1]). 
Therefore, thousands of different peptides are presented by the different MHC 
class-I haplotypes and each of the peptides is presented in vastly differing 
5 copies per cell [2]. When cells become infected, some of the presented 
peptides are derived from the pathogen's proteins, and so indicate to 
circulating T-cells to kill the diseased cells and prevent the spread of the 
disease. 

Each MHC haplotype recognizes the peptides through a broadly defined 

10 consensus motif of peptide's amino acids strategically positioned to serve as 
anchors to the appropriate binding pockets on the MHC molecule. The 
binding motifs of many of the MHCs haplotypes were first established by pool 
Edman sequencing of unfractionated peptide mixtures eluted from 
immunoaffinity purified MHC molecules [3, 4], The consensus was further 

15 extended by direct biochemical analysis of individual peptides separated by 
reversed phase chromatography and analyzed by tandem mass spectrometry [2, 
5, 6], reviewed in [7]. 

MHC bound peptides derived from cancer specific or associated 
proteins or antigens were extensively searched for, with the goal of finding 

20 among them peptide candidates for development of anti-cancer vaccines. A 
number of such tumor specific peptides were already identified and some were 
successfully tested as anti-cancer vaccines for human treatment, most notably 
for immunotherapy of melanoma [8, 9]. Three main approaches were 
extensively used for the identification of such MHC bound peptides [10]. The 

25 genetic approach involves transfection of cDNA libraries, made from tumor 
cells, into cells that present the MHC allele of interest. The clones of 
transfected cells that stimulated CTL lines against the tumor cells were 
selected as the source for the tumor antigen and the genes were further 
fragmented to isolate the regions of the genes that encode the particular 

30 immunogenic peptide [11]. The second approach is based on exploiting the 



known consensus binding motifs of the MHC haplotype of interest to scan 
sequences of known protein "m silico" and to predict putative MHC bound 
peptides that fit this consensus [12]. For successful prediction, these 
consensus motifs should be a priori well established, which is not the case for 
many of the MHC haplotypes [13]. The drawback of this approach is its 
reliance on chemical synthesis of a large number of peptides, only few of 
which end up being useful. The biochemical, third approach, involves the 
fractionation of the MHC bound peptides by chromatography, assaying the 
fractions for immuonoglogical activity and sequencing the individual peptides 
in the active fractions [2, 5]. The biochemical approach is the only possible 
way to identify post-translationally modified peptides, not always predictable 
from the protein sequences [14-16]. The biochemical approach depends on the 
availability of advanced mass spectrometry, needed for analyzing the available 
minute amounts of peptides that are present at very complex mixtures 
(reviewed in [7]). 

All these approaches for identifying MHC bound peptides eventually 
rely on chemical synthesis of the peptides of interest to test their capacity to 
bind to the MHC molecule by stabilization of empty MHC molecules on cell 
surface [17], and their potential to elicit an immune response by tetramer 
assays [18], ELISPOT [19] and elicitation CTL responses when presented on 
cells [20]. 

Currently, sequencing and identification of individual MHC bound 
peptides by the direct biochemical approach is most effectively performed by 
use of tandem mass spectrometry. The peptides are resolved by reversed phase 
chromatography and the eluting peptides are collected, assayed for biological 
activity and sequenced, most often by electrospray tandem mass spectrometry 
[2, 5, 21]. Comparing the patterns of MHC bound peptides recovered from 
healthy and infected cells helps to identify disease related peptides [22]. Mass 
spectrometry is advantageous for such analysis due to its accuracy, speed of 
analysis, its ability to analyze complex mixtures of peptides and its high 



sensitivity [7]. The biochemical analysis involves the purification of the MHC 
molecules with their bound peptides by immunoaffinity chromatography using 
mAbs specific for the native MHC [2]. To this end, the cells are solubilized 
with detergents, the desired MHC molecules are purified with their MHC 

5 bound peptides and the MHC bound peptides are recovered by denaturation 
and ultra-filtration. However, once the cells are disrupted by the detergents, 
the MHC molecules become contaminated by cellular debris and detergents 
which complicates the subsequent ESI-MS/MS analysis. Moreover, such 
immunoaffinity purification of desired MHC haplotypes is possible only when 

io specific mAbs are available, whereas for many MHC haplotypes such mAbs 
are presently unavailable. 

There is thus a widely recognized need for, and it would be highly 
advantageous to have, a method for identifying MHC bound peptides devoid 
of the above limitations. 

15 

SUMMARY OF THE INVENTION 

While conceiving the present invention, it was hypothesized that MHC 
bound peptides presented within the context of different MHC haplotypes on 
cells of different tissues or tumor origins can be biochemically identified by 

20 transforming the cells to express and secrete soluble MHC molecules of the 
different MHC haplotypes, with the aim of biochemically identifying the MHC 
bound peptides that bind to the soluble MHC molecules. Should this approach 
be successful, it solves three major problems associated with the prior art 
biochemical approach. First, although not excluded, there is no need for 

25 specific mAbs per each tipe of MHC, rather general mAbs such as W6/32 (anti 
HLA-A, B and C) can be used to isolate the sMHC and hence the MHC bound 
peptides from the growth medium in which the cells are grown. Second, while 
the prior art approach relies on native MHC molecules, different MHC 
haplotypes directing the expression of different soluble MHC molecules can 

30 potentially be used for each of the cells, to thereby increase the repertoire of 



MHC bound peptides which can be used as, for example, anti-cancer vaccines. 
Third, since the cells are not disrupted and further since there is no use of 
detergents, the sMHC molecules do not become contaminated by cellular 
debris and detergents which otherwise complicates the subsequent 
ESI-MS/MS analysis. 

According to one aspect of the present invention there is provided a 
method of identifying peptides originating from a particular cell type and being 
capable of binding to MHC molecules of a particular haplotype, the method 
comprising obtaining a cell type expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; and analyzing 
peptides bound to the soluble and secreted form of the MHC molecules of the 
particular haplotype, thereby identifying the peptides originating from the 
particular cell type and being capable of binding to MHC molecules of the 
particular haplotype. 

According to further features in preferred embodiments of the invention 
described below, the cell type is a cancer cell. 

According to still further features in the described preferred 
embodiments the cell type is a cancer cell line. 

According to still further features in the described preferred 
embodiments the cell type is a virus infected cell or cell line. 

According to still further features in the described preferred 
embodiments the cell type is a cell involved in a development and/or 
progression of an autoimmune diseases. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from at least one protein of interest 
and being capable of binding to MHC molecules of a particular haplotype, the 
method comprising obtaining cells co-expressing the at least one protein of 
interest and a soluble and secreted form of the MHC molecules of the 
particular haplotype; collecting the soluble and secreted form of the MHC 
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molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from the at least one protein of interest and 
being capable of binding to MHC molecules of the particular haplotype. 
5 According to further features in preferred embodiments of the invention 

described below, the protein of interest in natively expressed by the cells. 

According to still further features in the described preferred 
embodiments the at least one protein of interest in expressed by the cells 
following transformation of the cells with nucleic acid encoding for the at least 
l o one protein of interest. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a tumor associated 
antigen. 

According to still further features in the described preferred 
15 embodiments the at least one protein of interest includes a cytokine. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a protein of a 
pathogen. 

According to still further features in the described preferred 
20 embodiments the soluble and secreted form of the MHC molecules include a 
polypeptide encoded by exons 5 to 8 of a murine mutant Q10 b . 

According to still further features in the described preferred 
embodiments analyzing the peptides bound to the soluble and secreted form of 
the MHC molecules of the particular haplotype is by mass spectrometry, mass 
25 charge ratio and collision induced disintegration. 

According to still further features in the described preferred 
embodiments identifying peptides originating from the at least one protein of 
interest and being capable of binding to MHC molecules of the particular 
haplotype is by comparison to a protein database. 



According to another aspect of the present invention, there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 
sequences of peptides identified by the methods described herein. 

According to still another aspect of the present invention, there is 
5 provided a kit comprising a plurality of individual containers, each of the 
plurality of individual containers containing at least one peptide identified by 
the methods described herein. 

According to yet another aspect of the present invention there is 
provided a method of identifying peptides originating from cancer associated 

10 proteins and being capable of binding to MHC molecules of a particular 
haplotype, the method comprising obtaining a cancer cell type expressing a 
soluble and secreted form of the MHC molecules of the particular haplotype; 
collecting the soluble and secreted form of the MHC molecules of the 
particular haplotype; analyzing peptides bound to the soluble and secreted 

15 form of the MHC molecules of the particular haplotype; and identifying 
peptides originating from cancer associated proteins and being capable of 
binding to MHC molecules of the particular haplotype. 

According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from cells participating 

20 in the development and/or progression of an autoimmune disease and being 
capable of binding to MHC molecules of a particular haplotype, the method 
comprising obtaining cells participating in the development and/or progression 
of the autoimmune disease and expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 

25 form of the MHC molecules of the particular haplotype; analyzing peptides 
bound to the soluble and secreted form of the MHC molecules of the particular 
haplotype; and identifying peptides originating from proteins participating in 
the development and/or progression of the autoimmune disease and being 
capable of binding to MHC molecules of the particular haplotype. 
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According to an additional aspect of the present invention there is 
provided a method of identifying peptides originating from virus infected cells 
and being capable of binding to MHC molecules of a particular haplotype, the 
method comprising obtaining virus infected cells expressing a soluble and 
secreted form of the MHC molecules of the particular haplotype collecting the 
soluble and secreted form of the MHC molecules of the particular haplotype; 
analyzing peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype; and identifying peptides originating 
from the virus and being capable of binding to MHC molecules of the 
particular haplotype. 

According to yet an additional aspect of the present invention there is 
provided a method of identifying peptides originating from a particular cell 
type characterized by at least one of the following (i) cell over-expressing at 
least one protein; (ii) cells characterized by induced mutations; (iii) cells of 
metastases; (iv) normal or transformed cells expressing cell surface proteins, 
the peptides being capable of binding to MHC molecules of a particular 
haplotype, the method comprising obtaining cells of the particular cell type 
expressing a soluble and secreted form of the MHC molecules of the particular 
haplotype; collecting the soluble and secreted form of the MHC molecules of 
the particular haplotype; analyzing peptides bound to the soluble and secreted 
form of the MHC molecules of the particular haplotype; and identifying 
peptides originating from the particular cell type and being capable of binding 
to MHC molecules of the particular haplotype. 

According to still an additional aspect of the present invention there is 
provided an electronic data storage device, storing, in a retrievable form, a 
plurality of peptides being arranged at least according to their association with 
a pathology and further according to their ability of binding to MHC molecules 
of a particular haplotype. 

According to a further aspect of the present invention there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 



peptides being arranged at least according to their association with a protein of 
interest and further according to their ability of binding to MHC molecules of 
a particular haplotype. 

According to yet a further aspect of the present invention there is 
5 provided a method of eliciting an immune response against a protein of interest 
in a subject having a particular MHC haplotype, the method comprising 
determining the subject's particular MHC haplotype; and administering to the 
subject an effective amount of at least one peptide derived from the protein of 
interest and which is capable of binding to MHC molecules of the particular 
10 haplotype. 

According to still a further aspect of the present invention there is 
provided a method of treating a pathology by eliciting an immune response 
against a protein of interest in a subject having a particular MHC haplotype, 
the method comprising determining the subject's particular MHC haplotype; 

15 and administering to the subject a therapeutic effective amount of at least one 
peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

According to an additional aspect of the present invention, there is 
provided a method of eliciting an immune response against a protein of interest 

20 in a subject, the method comprising using an individualized in vitro assay for 
determining an immune reactivity of an immune system of the subject to a 
plurality of peptides derived from the protein of interest; and administering to 
the subject an effective amount of at least one peptide derived from the protein 
of interest and which is capable of inducing predetermined sufficient immune 

25 reactivity. 

According to further features in preferred embodiments of the invention 
described below, administering to the subject the therapeutically effective 
amount of the at least one peptide is accompanied by presenting the at least 
one peptide in context of an antigen presenting cell. 
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According to still an additional aspect of the present invention, there is 

provided a peptide selected from the group consisting of SEQ ID NOs:4-6, 

10-14, 19-21, 23-37, 44-88, 90-141, 143-144, 146-173, 175-189 and 191-195, 

all of which were never reported to bind MHC molecules. 

5 According to still an additional aspect of the present invention, there is 

provided a peptide selected from the group consisting of SEQ ID NOs: 5, 9, 10 

and 25. 

According to yet an additional aspect of the present invention, there is 
provided a peptide selected from the group consisting of SEQ ID NOs: 13, 20, 
10 23 and 24. 

According to another aspect of the present invention, there is provided a 
pharmaceutical composition comprising, as an active ingredient, at least one of 
the peptides described herein, and a pharmaceutically acceptable carrier. 
Preferably, the at least one of the peptides is presented in context of an antigen 
15 presenting cell. 

According to further features in preferred embodiments of the invention 
described below, the peptide comprises at least one modification rendering 
peptides more stable in a body and/or more immunogenic. 

According to still further features in the described preferred 
20 embodiments the at least one modification is selected from the group 
consisting of peptoid modification, semipeptoid modification, cyclic peptide 
modification, N terminus modification, C terminus modification, peptide bond 
modification, backbone modification and residue modification. 

The present invention successfully addresses the shortcomings of the 
25 presently known configurations by providing a novel method for the 
identification of MHC bound peptides. 



BRIEF DESCRIPTION OF THE DRAWINGS 
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The invention is herein described, by way of example only, with 
reference to the accompanying drawings. With specific reference now to the 
drawings in detail, it is stressed that the particulars shown are by way of 
example and for purposes of illustrative discussion of the preferred 
5 embodiments of the present invention only, and are presented in the cause of 
providing what is believed to be the most useful and readily understood 
description of the principles and conceptual aspects of the invention. In this 
regard, no attempt is made to show structural details of the invention in more 
detail than is necessary for a fundamental understanding of the invention, the 
10 description taken with the drawings making apparent to those skilled in the art 
how the several forms of the invention may be embodied in practice. 

In the drawings: 

FIGs. 1A-C demonstrate the purification of soluble MHC from cancer 
cells. Soluble MHCs was purified by immunoaffinity from the growth 

15 medium of 5xl0 7 transfected cells. Purified proteins were analyzed on 10% 
SDS-PAGE and stained with Coomassie blue. (1A) Purification of 
HLA-A2/Q10b from MCF-7 cells, (IB) Purification of sHLA-A2 from 
UCI-101 cells, (1C) Purification of sHLA-B7 from UCI-107 cells. 

FIGs. 2A-C demonstrate the purification process of soluble HLA-Cw4 

20 from human ovrian cancer cells. (2A) Chromatorgaphy purification of 
sHLA-Cw4 from the growth medium of UCI-101 transfected cells. Column S, 
represents first purification step on sepharose beads. Column W6, represents 
second purification step by affinity chromatrography on W6/32 antibody 
column; (2B) SDS-PAGE analysis and Coomassie staining of eluted fractions 

25 (el-e5); (2C) Western blot analysis of eluted fractions. 

FIGs. 3A-C show a typical nano-capillary reversed-phase 
chromatography of MHC bound peptides purified from soluble MHC from 
5X10 7 MCF-7 breast cancer cells. (3A) The total-ion-current chromatogram 
(TIC). (3B) Mass spectrum taken at the time point of 33.3 minutes. (3C) 

30 Spectrum of the collision-induced-disintegration (CID) of the dominant 
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peptide in 3B having a m/z of 1028.5 that eluted at the 33.3 minutes. The 
putative MHC peptide GLIEKNIEL (SEQ ID NO: 13) was identified to 
originate from DNA-methyl transferase. 

FIGs. 4A-B show a comparison of the chromatographs, the MS and the 
CID spectra of the synthetic peptide: GLIEKNIEL (SEQ ID NO: 13) of the 
DNA methyl transferase (4A) with those of the peptide m/z=1028.5 (SEQ ID 
NO: 13) from the breast cancer line MCF-7 (4B). 

FIGs. 5A-D demonstrate the evaluation of the correctness of the 
MAGE-B2 peptide pi 091 (GVYDGEEHSV, SEQ ID NO:20) by comparing 
the retention time and CID spectra of the synthetic peptide (5 A) to that of the 
natural peptide m/z=1091.4 (SEQ ID NO:20) from the ovarian cancer line 
UCI-107 (5B). (5C) Evaluation of the binding affinity of peptide pi 091 (SEQ 
ID NO:20) to HLA-A2 by reconstituting it into cells surface empty MHC of 
the RMA-S-HHD cells as assayed by FACS analysis. (5D) The homology 
between this MAGE-B2 peptide, pi 091 (SEQ ID NO:20) to two other already 
known HLA-A2 peptides derived from homologous region in MAGE-A4 
GVYDGREHTV (SEQ ID NO:38) [27] and MAGE-A10 proteins 
GLYDGMEHL (SEQ ID NO:39) [28]. 

FIG. 6 shows an example of reconstitution of peptides into cells surface 
MHC to test their binding and affinity as assayed by FACS analysis. Synthetic 
peptides were added to 10 6 RMA-S-HHD cells to a concentration of 100 jJVI 
followed by incubation for two hours at 26°C and two hours at 37°C. The 
stability of the peptides binding to the HHD cells was measured by indirect 
FACS assay after decoration for another hour with the W6/32 mAb at 4°C and 
30 minutes incubation with FITC goat anti-mouse Ab at 4°C. The HLA-A2.1 
peptide derived from gplOO served as a positive control and unloaded 
RMA-S-HHD cells as a negative control. 

FIG. 7 demonstrates a CTL assay with murine cells presenting human 
MHC (EL4-HHD). Cells were loaded separately with individual peptides, 
washed and injected in four groups: 1- pl028 (SEQ ID NO:13) alone, 2- 
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pl258 (SEQ ID NO:24) alone, 3- pool of peptides: p913 (SEQ ID NO:5), p958 
(SEQ ID NO:9), p989 of CD59 (SEQ ID NO:ll) and p989 of FLI (SEQ ID 
NO: 12), 4- peptides pl031 (SEQ ID NO:14), pll21 (SEQ ID NO:22) and 
pl068 (SEQ ID NO:16). Unloaded EL4-HHD or targets cells not loaded with 
the peptides were used as negative controls. An effector-to-target ratio of 50:1 
is shown. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of a method of identifying peptides of a desired 
origin, such as tumor associated antigens, pathogen (e.g., virus, bacteria) 
derived antigens, endogenous cytokines, etc., which are capable of binding to 
MHC molecules of a particular haplotype. The present invention is further of 
peptides identified by the method and pharmaceutical compositions containing 
the peptides. Still, the present invention is further of databases describing the 
peptides and the use of the peptides in vaccination to treat and/or prevent 
various pathologies, cancer and autoimmune diseases, in particular. 

The principles and operation of the present invention may be better 
understood with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it 
is to be understood that the invention is not limited in its application to the 
details set forth in the following description or exemplified by the Examples. 
The invention is capable of other embodiments or of being practiced or carried 
out in various ways. Also, it is to be understood that the phraseology and 
terminology employed herein is for the purpose of description and should not 
be regarded as limiting. 

According to one aspect of the present invention there is provided a 
method of identifying peptides originating from a particular cell type and being 
capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining a cell 
type expressing a soluble and secreted form of the MHC molecules of the 
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particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplotype; and analyzing peptides bound to the 
soluble and secreted form of the MHC molecules of the particular haplotype, 
thereby identifying the peptides originating from the particular cell type and 
being capable of binding to MHC molecules of the particular haplotype. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from cancer associated proteins and 
being capable of binding to MHC molecules of a particular haplotype. The 
method according to this aspect of the present invention is effected by 
obtaining a cancer cell type expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; analyzing peptides 
bound to the soluble and secreted form of the MHC molecules of the particular 
haplotype; and identifying peptides originating from cancer associated proteins 
and being capable of binding to MHC molecules of the particular haplotype. 

According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from cells participating 
in the development and/or progression of an autoimmune disease and being 
capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining cells 
participating in the development and/or progression of the autoimmune disease 
and expressing a soluble and secreted form of the MHC molecules of the 
particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from proteins participating in the development 
and/or progression of the autoimmune disease and being capable of binding to 
MHC molecules of the particular haplotype. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from virus infected cells and being 
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capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining virus 
infected cells expressing a soluble and secreted form of the MHC molecules of 
the particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from the virus and being capable of binding to 
MHC molecules of the particular haplotype. 

According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from a particular cell 
type characterized by at least one of the following (i) cell over-expressing at 
least one protein; (ii) cells characterized by induced mutations; (iii) cells of 
metastases; (iv) normal or transformed cells expressing cell surface proteins, 
the peptides being capable of binding to MHC molecules of a particular 
haplotype. The method according to this aspect of the present invention is 
effected by obtaining cells of the particular cell type expressing a soluble and 
secreted form of the MHC molecules of the particular haplotype; collecting the 
soluble and secreted form of the MHC molecules of the particular haplotype; 
analyzing peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype; and identifying peptides originating 
from the particular cell type and being capable of binding to MHC molecules 
of the particular haplotype. 

In general, the present invention provides a method of identifying 
peptides originating from at least one protein of interest or an unknown protein 
and being capable of binding to MHC molecules of a particular haplotype. 
The method is effected by obtaining cells co-expressing the at least one protein 
of interest or unknown protein and a soluble and secreted form of the MHC 
molecules of the particular haplotype; collecting the soluble and secreted form 
of the MHC molecules of the particular haplotype; analyzing peptides bound to 
the soluble and secreted form of the MHC molecules of the particular 
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haplotype; and identifying peptides originating from the at least one protein of 
interest or unknown protein and being capable of binding to MHC molecules 
of the particular haplotype. Depending to a great extent on the cell type 
employed, it will Once a peptide of an unknown protein is identified, this 
5 protein becomes a protein of interest. 

The protein of interest or unknown protein can be a native protein 
expressed by the cells, or the protein of interest can be expressed by the cells 
following transformation of the cells with nucleic acid encoding for the protein 
of interest using techniques well known in the art. 

10 The method of the present invention can thus be used to associate 

proteins of yet unknown pattern of expression with particular tissues or cell 
types, such as cancer cells. In addition, the method of the present invention 
can be used to determine whether a specific open reading frame (ORF) is 
expressed or not in certain cells. 

15 In one preferred embodiment of the present invention the cell type is a 

cancer cell or a cancer cell line. Primary cell lines, metastatic cell lines, tumor 
cell lines and normal cell lines which are suitable for implementing the method 
of the present invention are available, for example, from ATCC. Tables 1 and 
2 below provide examples: 

20 

TABLE 1 
Primary and metastatic cell lines 



Primary Cell Line 


Metastatic Cell Line 


ATCC 
No. 


Name 


Disease 


Tissue 


ATCC 
No. 


Name 


Tissue 


CCL-228 


SW480 


colorectal 
adenocarcinoma 


colon 


CCL-227 


SW620 


lymph 
node 


CRL-1864 


RF-1 


gastric adenocarcinoma 


stomach 


CRL-1863 


RF-48 


ascites 


CRL-1675 


WM-115 


melanoma 


skin 


CRL-1676 


WM-266-4 


n/a 


CRL-7425 


Hs 688(A).T 


melanoma 


skin 


CRL-7426 


Hs 688(B).T 


lymph 
node 



25 



TABLE 2 



17 

Tumor and normal cell lines 



Tumor Cell Line 


Normal Cell Line 


ATCC 
No. 


Name 


Cancer 
Type 


Tissue 
Source 


ATCC 
No. 


Name 


Tissue 
Source 


CCL-256 


NCI-H2126 


carcinoma; non-small 
cell lung cancer 


lung 


CCL-256. 1 


NCI-BL2126 


peripheral 
blood 


CRL-5868 


NCI-H1395 


adenocarcinoma 


lung 


CRL-5957 


NCI-BL1395 


peripheral 
blood 


CRL-5872 


NCI-H1437 


adenocarcinoma 


lung 


CRL-5958 


NCI-BL1437 


peripheral 
blood 


CRL-5882 


NCI-H1648 


adenocarcinoma 


lung 


CRL-5954 


NCI-BL1648 


peripheral 
blood 


CRL-591 1 


NCI-H2009 


adenocarcinoma 


lung 


CRL-5961 


NCI-BL2009 


peripheral 
blood 


CRL-5985 


NCI-H2122 


adenocarcinoma 


pleural effusion 


CRL-5967 


NCI-BL2122 


peripheral 
blood 


CRL-5922 


NCI-H2087 


adenocarcinoma 


lymph node 
(metastasis) 


CRL-5965 


NCI-BL2087 


peripheral 
blood 


CRL-5886 


NCI-H1672 


carcinoma; classic 
small cell lung cancer 


lung 


CRL-5959 


NCI-BL1672 


peripheral 
blood 


CRL-5929 


NCI-H2171 


carcinoma; small cell 
lung cancer 


lung 


CRL-5969 


NCI-BL2171 


peripheral 
blood 


CRL-5931 


NCI-H2195 


carcinoma; small cell 
lung cancer 


lung 


CRL-5956 


NCI-BL2195 


peripheral 
blood 


CRL-5858 


NCI-H1184 


carcinoma; small cell 
lung cancer 


lymph node 
(metastasis) 


CRL-5949 


NCI-BL1184 


peripheral 
blood 


HTB-172 


NCI-H209 


carcinoma; small cell 
lung cancer 


bone marrow 
(metastasis) 


CRL-5948 


NCI-BL209 


peripheral 
blood 


CRL-5983 


NCI-H2107 


carcinoma; small cell 
lung cancer 


bone marrow 
(metastasis) 


CRL-5966 


NCI-BL2107 


peripheral 
blood 


HTB-120 


NCI-H128 


carcinoma; small cell 
lung cancer 


pleural effusion 


CRL-5947 


NCI-BL128 


peripheral 
blood 


CRL-591 5 


NCI-H2052 


mesothelioma 


pleural effusion 


CRL-5963 


NCI-BL2052 


peripheral 
blood 


CRL-5893 


NCI-H1770 


neuroendocrine 
carcinoma 


lymph node 
(metastasis) 


CRL-5960 


NCI-BL1770 


peripheral 
blood 


HTB-126 


Hs 578T 


ductal carcinoma 


mammary 
gland; breast 


HTB-125 


Hs 578Bst 


mammary 

gland; 

breast 


CRL-2320 


HCC1008 


ductal carcinoma 


mammary 
gland; breast 


CRL-2319 


HCC1007 
BL 


peripheral 
blood 


CRL-2338 


HCC1954 


ductal carcinoma 


mammary 
gland; breast 


CRL-2339 


HCC1954 
BL 


peripheral 
blood 


CRL-2314 


HCC38 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2346 


HCC38 BL 


peripheral 
blood 


CRL-2321 i 


HCC1143 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2362 


HCC1143 
BL 


peripheral 
blood 


CRL-2322 


HCC1187 


primary ductal 


mammary 


CRL-2323 


HCC1187 


peripheral 
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carcinoma 


gland; breast 




BL 


blood 


CRL-2324 


HCC1395 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2325 


HCC1395 
BL 


peripheral 
blood 


CRL-2331 


HCC1599 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2332 


HCC1599 
BL 


peripheral 
blood 


CRL-2336 


HCC1937 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2337 


HCC1937 
BL 


peripheral 
blood 


CRL-2340 


HCC2157 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2341 


HCC2157 
BL 


peripheral 
blood 


CRL-2343 


HCC2218 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2363 


HCC2218 
BL 


peripheral 
blood 


CRL-7345 


Hs 574.T 


ductal carcinoma 


mammary 
gland; breast 


CRL-7346 


Hs 574.Sk 


skin 


CRL-7482 


Hs 742.T 


scirrhous 
adenocarcinoma 


mammary 
gland; breast 


CRL-7481 


Hs 742.Sk 


skin 


CRL-7303 


Hs 496.T 


cancer 


mammary 
gland; breast 


CRL-7302 


Hs 496.Sk 


skin 


CRL-7486 


Hs 748.T 


cancer 


mammary 
gland; breast 


CRL-7485 


Hs 748.Sk 


skin 


CRL-7365 


Hs 605.T 


carcinoma 


mammary 
gland; breast 


CRL-7364 


Hs 605.Sk 


skin 


CRL-7368 


Hs 606 


carcinoma 


mammary 
gland; breast 


CRL-7367 


Hs 606.Sk 


skin 


CRL-1974 


COLO 829 


malignant melanoma 


skin 


CRL-1980 


COLO 
829BL 


peripheral 
blood 


CRL-7762 


TE 354T 


basal cell carcinoma 


skin 


CRL-7761 


TE 353.Sk 


skin 


CRL-7690 


Hs 939T 


malignant melanoma 


skin 


CRL-7689 


Hs 939.Sk 


skin 


CRL-7360 


Hs 600.T 


melanoma 


skin 


CRL-7359 


Hs 600.Sk 


skin 


CRL-7677 


Hs 925.T 


pagetoid sarcoma 


skin 


CRL-7676 


Hs 925.Sk 


skin 


CRL-7672 


Hs919.T 


benign osteoid 
osteoma 


bone 


CRL-7671 


Hs919.Sk 


skin 


CRL-7554 


Hs 821. T 


giant cell sarcoma 


bone 


CRL-7553 


Hs 821. Sk 


skin 


CRL-7552 


Hs 820.T 


heterophilic 
osteofication 


bone 


CRL-7551 


Hs 820.Sk 


skin 


CRL-7444 


Hs 704.T 


osteosarcoma 


bone 


CRL-7443 


Hs 704.Sk 


skin 


CRL-7448 


Hs 707(A).T 


osteosarcoma 


bone 


CRL-7449 


Hs 

707(B).Ep 


skin 


CRL-7471 


Hs 735.T 


osteosarcoma 


bone 


CRL-7865 


Hs 735.Sk 


skin 


CRL-7571 


Hs 836.T 


osteosarcoma 


bone 


CRL-7570 


Hs 836.Sk 


skin 


CRL-7595 


Hs 860.T 


osteosarcoma 


bone 


CRL-7519 


Hs 791 .Sk 


skin 


CRL-7622 


Hs 888.T 


osteosarcoma 


bone 


CCL-211 


Hs888Lu 


lung 


CRL-7626 


Hs 889.T 


osteosarcoma 


bone 


CRL-7625 


Hs 889.Sk 


skin 


CRL-7628 


Hs 890.T 


osteosarcoma 


bone 


CRL-7627 


Hs 890.Sk 


skin 


CRL-7453 


Hs 709.T 


periostitis; granuloma 


bone 


CRL-7452 


Hs 709.Sk 


skin 


CRL-7432 


Hs 696T 


adenocarcinoma 


unknown 


CRL-7431 


Hs 696.Sk 


skin 


CRL-7886 


Hs 789.T 


transitional cell 
carcinoma 


ureter 


CRL-7518 


Hs 789.Sk 


skin 


CRL-7547 


Hs814T 


giant cell sarcoma 


vertebral 
column 


CRL-7546 


Hs 814.Sk 


skin 
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In another preferred embodiment of the invention, the cell type is a 
virus infected cell or cell line. Table 3 below provides examples of some 
known viruses and the diseases they cause: 



TABLE 3 



Diseases 


Viruses and other pathogens 


African sleeping sickness (African 
trypanosomiasis) 


Trypanosoma brucei 


AIDS 


HIV 


Amebiasis 


Entamoeba histolytica 


BSE ("mad cow disease") and nvCJD 




Campylobacter infections 


Campylobacter 


Chagas' disease (American trypanosomiasis) 


Trypanosoma cruzi 


Cholera 


Vibrio cholerae 


Coccidioidomycosis 


Coccidioides immitis 


Cryptosporidiosis 


Crypto sporidium 


Cyclosporiasis 


Cyclospora 


Dengue fever 


Dengue viruses 


Diphtheria, tetanus, and pertussis 


Toxin-producing strains of Corynebacterium 
diphtheriae 


Bordetella pertussis 




Encephalitis 


Japanese encephalitis virus Tickborne 
encephalitis West Nile virus 


Filariasis 


Wuchereria bancrofti and Brugia malayi 


Giardiasis (Giardia infection) 


Giardia intestinalis 


Hantavirus pulmonary syndrome 


Hantavirus 


Hepatitis 


Hepatitis viruses A, B, C, E 


Histoplasmosis 


Histoplasma capsulatum 


Influenza (flu) 




Leishmaniasis 


Leishmania 


Leptospirosis 


Leptospira 


Lyme disease 


B. burgdorferi sensu stricto, B. afzelii, or B. 
garinii 


Malaria 


Plasmodium falciparum P. vivax P. ovale and P. 
malariae 


Measles, mumps, and rubella (MMR) 
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oneumoniae and Neisseria meningitidis 


Onchocerciasis (river blindness) 


Onchocerca volvulus 


Plague 


Yersinia nestis i 


Poliomyelitis 




Rabies 


Rhabdoviridae, genus Lyssavirus 


Rocky Mountain spotted fever 




Rickettsia 


rickettsii 


severe diarrhea 


Rotavirus 


Salmonellosis 


Salmonella 


Schistosomiasis 




Shigellosis 


Shigella 


Tuberculosis (TB) 


Mvcohacterium niHercnlnsis 


Typhoid fever 


Salmonella serogroup Typhi 


Tvohus fevers 




chickenpox 


Varicella 


Vibrio parahaemolyticus 




Viral hemorrhagic fevers (e.g., Ebola, Lassa, 
Marburg, Rift Valley). 


arenaviruses, filoviruses, bunyaviruses, and 
flaviviruses 


Yellow fever 





In yet another preferred embodiment of the present invention, the cell 
type is a cell involved in a development and/or progression of an autoimmune 
diseases such as T or B cells, and/or an allergic disease or condition, such as 
5 mast cells. 

In one example, the at least one protein of interest is a tumor associated 
antigen. The tumor associated antigen can be natively expressed by the cells 
or can be expressed by appropriately transformed cells. Table 4 below lists 
some known genes encoding proteins which were identified as tumor 
io associated antigens. 



TABLE 4 



Gene Symbol 


Gene Name 


Locus 


Disorders 
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ABLl 


v-abl Abelson murine leukemia viral 
oncogene homo log 1 


9q34.1 


Leukemia, chronic myeloid 


ABL2 


v-abl Abelson murine leukemia viral 
oncogene homolog 2 (arg, Abelson-related 
gene) 


Iq24-q25 


Leukemia, acute myeloid, with eosinophilia 


AKT2 


v-akt murine thymoma viral oncogene 
homolog 2 


19ql3.1-ql3.2 


Ovarian carcinoma 


ARHI 


ras homolog gene family, member I 


lp31 


Ovarian cancer 


ARP 




3p21.1 


Pancreatic cancer 


AXIN2 


axin 2 (conductin, axil) 


17q23-q24 


Colorectal cancer 


BAX 


BCL2 -associated X protein 


19ql3.3-ql3.4 


Colorectal cancer 

T-cell acute lymphoblastic leukemia 


BCPR 


homeo box B9 


17pl3.3 


Breast cancer 


BRCA1 


breast cancer 1 , early onset 


17q21 


Breast cancer- 1 
Ovarian cancer 
Breast-ovarian cancer 


BRCA2 


breast cancer 2, early onset 


13ql2.3 


Breast cancer 2, early onset 
Pancreatic cancer 


BRCA3 




llq23 


Breast cancer-3 


BRCA4 




13q21 


Breast cancer, type 4 


BRCAX 




13q21 


Breast cancer, type 4 


BRCD1 




13 


Breast cancer, ductal 


BRCD2 




lp36 


Breast cancer, ductal 


BUB1 


budding uninhibited by benzimidazoles 1 
(yeast homolog) 


2ql4 


Colorectal cancer with chromosomal 
instability 


CDH1 


cadherin 1, type 1, E-cadherin (epithelial) 


16q22.1 


Endometrial carcinoma 
Ovarian carcinoma 
Breast cancer, 


CLD 


congenital chloride diarrhea 


7q22-q31.1 


Colon cancer 

Chloride diarrhea, congenital, Finnish type, 


CSF1R 


colony stimulating factor 1 receptor, formerly 
McDonough feline sarcoma viral (v-fms) 
oncogene homolog 


5q33.2-q33.3 


Myeloid malignancy, predisposition to 


CTNNB1 


catenin (cadherin-associated protein), beta 1 
(88kD) 


3p22-p21.3 


Colorectal cancer 

Hepatoblastoma 

Pilomatricoma 


CYLD 


cylindromatosis (turban tumor syndrome) 


16ql2-ql3 


Cylindromatosis, familial 


DCC 


deleted in colorectal carcinoma 


18q21.3 


Colorectal cancer 


DEK 


DEK oncogene (DNA binding) 


6p23 


Leukemia, acute nonlymphocytic 


DLEC1 


deleted in lung and esophageal cancer 1 


3p22-p21.3 


Lung cancer 
Esophageal cancer 


DMBT1 


deleted in malignant brain tumors 1 


10q25.3-q26.1 


Glioblastoma multiforme 
Medulloblastoma 


DRA 


down-regulated in adenoma 


7q22-q31.1 


Colon cancer 

Chloride diarrhea, congenital, Finnish type, 


ELAC2 


elaC (E. coli) homolog 2 


17p 


Prostate cancer, susceptibility to 


EP300 


El A binding protein p300 


22ql3 


Colorectal cancer 


ESR1 


estrogen receptor 1 


6q25.1 


Breast cancer 
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Estrogen resistance 


ETV6 


ets variant gene 6 (TEL oncogene) 


12pl3 


Leukemia, acute lymphoblastic 


FSHR 


follicle stimulating hormone receptor 


2p21-pl6 


Premature ovarian failure 
Ovarian sex cord tumors 


HNPCC7 


3346 


15q21.1 


Colorectal cancer, hereditary nonpolyposis, 
type 7 


HPC1 


hereditary prostate cancer 1 


Iq24-q25 


Prostate cancer, susceptibility to 


HPCX 


hereditary prostate cancer, X-linked 


Xq27-q28 


Prostate cancer, susceptibility to 


HRAS 


v-Ha-ras Harvey rat sarcoma viral oncogene 
homo log 


llpl5.5 


Bladder cancer 


HRPT2 


hyperparathyroidism 2 (with jaw tumor) 


Iq25-q31 


Hyperparathyroidism-]' aw tumor syndrome 
Hyperparathyroidism, 


KAI1 


kangai 1 (suppression of tumorigenicity 6, 
prostate; CD82 antigen (R2 leukocyte 
antigen, antigen detected by monoclonal and 
antibody IA4)) 


llpll.2 


Prostate cancer, susceptibility to 


KIT 


v-kit Hardy-Zuckerman 4 feline sarcoma viral 
oncogene homolog 


4ql2 


Piebaldism 

Mast cell leukemia 

Mastocytosis with associated 


KRAS1P 


v-Ki-rasl Kirsten rat sarcoma 1 viral 
oncogene homolog, processed pseudogene 


12pl2.1 


Colorectal adenoma 
Colorectal cancer 


KRAS2 


v-Ki-ras2 Kirsten rat sarcoma 2 viral 
oncogene homolog 


12pl2.1 


Colorectal adenoma 
Colorectal cancer 


LCFS2 


mitochondrial ribosomal protein LI 3 


18qll-ql2 


?Lynch cancer family syndrome II 


LCO 


liver cancer oncogene 


2ql4-q21 


Hepatocellular carcinoma 


MADH4 


MAD (mothers against decapentaplegic, 
Drosophila) homolog 4 


18q21.1 


Pancreatic cancer 
Polyposis, juvenile intestinal 


MCC 


mutated in colorectal cancers 


5q21 


Colorectal cancer 


MERTK 


c-mer proto-oncogene tyrosine kinase 


2ql4.1 


Retinitis pigmentosa, MERTK-related 


MET 


met proto-oncogene (hepatocyte growth 
factor receptor) 


7q31 


Renal cell carcinoma, papillary, familial and 
sporadic 


MGCT 




12q22 


Male germ cell tumor 


MLH1 


mutL (E. coli) homolog 1 (colon cancer, 
nonpolyposis type 2) 


3p2L3 


Colorectal cancer, hereditary nonpolyposis, 
type 2 


MPL 


myeloproliferative leukemia virus oncogene 


lp34 ! 


Thrombocytopenia, congenital 
amegakaryocytic 


MSH2 


mutS (E. coli) homolog 2 (colon cancer, 
nonpolyposis type 1 ) 


2p22-p21 


Colorectal cancer, hereditary nonpolyposis, 
type 1 


MSH6 


mutS (E. coli) homolog 6 


2pl6 


Cancer susceptibility 
Endometrial carcinoma 
Colorectal 


MTACR1 


multiple tumor-associated chromosome 
region 1 


Upl5.5 


Wilms tumor, type 2 

Adrenocortical carcinoma, hereditary, 202300 


MYC 


v-myc avian myelocytomatosis viral 
oncogene homolog 


8q24.12-q24.13 


Burkitt lymphoma 


NRAS 


neuroblastoma RAS viral (v-ras) oncogene 
homolog 


lpl3.2 


Colorectal cancer 


PCAP 


predisposing for prostate cancer 


Iq42.2-q43 


Prostate cancer, susceptibility to 


PCBC 


3475 


lp36 


Prostate cancer, susceptibility to 
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PDGFB 


platelet-derived growth factor beta 
polypeptide (simian sarcoma viral (v-sis) 
oncogene homolog) 


22ql2.3-ql3.1 


Meningioma, SIS-related 
Dermatofibrosarcoma protuberans 


PDGFRL 


platelet-derived growth factor receptor-like 


8p22-p21.3 


Hepatocellular cancer 
Colorectal cancer 


PGL2 


paraganglioma or familial glomus tumors 2 


llql3.1 


Paraganglioma, familial nonchromaffin 


PGL3 


paraganglioma or familial glomus tumors 3 


lq21 


/.Paragangliomas, familial nonchromaffin, 3 


PHB 


prohibitin 


17q21 


Breast cancer, sporadic 


PEK3CA 


phosphoinositide-3 -kinase, catalytic, alpha 
polypeptide 


3q26.3 


Ovarian cancer 


PMS1 


postmeiotic segregation increased (S. 
cerevisiae) 1 


2q31-q33 


Colorectal cancer, hereditary nonpolyposis, 
type 3 


PMS2 


postmeiotic segregation increased (S. 
cerevisiae) 2 


7p22 


Turcot syndrome with glioblastoma 
Colorectal cancer, 


PPP2R1B 


protein phosphatase 2 (formerly 2A), 
regulatory subunit A (PR 65), beta isoform 


Ilq22-q24 


Lung cancer 


PRCA1 


prostate cancer 1 


Iq24-q25 


Prostate cancer, susceptibility to 


PRKCA 


protein kinase C, alpha 


17q22-q23.2 


Pituitary tumor, invasive 


PTEN 


phosphatase and tensin homolog (mutated in 
multiple advanced cancers 1 ) 


10q23.3 


Cowden disease 
Lhermitte-Duclos syndrome 


PTPN12 


protein tyrosine phosphatase, non-receptor 
type 12 


7ql 1.23 


Colon cancer 


RAB27A 


RAB27A, member RAS oncogene family 


15q21 


Griscelli syndrome 


RAD51 


RAD51 (S. cerevisiae) homolog (E coli RecA 
homolog) 


15ql5.1 


Breast cancer, susceptibility to 


RAD54L 


RAD54 (S.cerevisiae)-like 


lp32 


Lymphoma, non-Hodgkin 

Breast cancer, invasive intraductal 


RBI 


retinoblastoma 1 (including osteosarcoma) 


13ql4.1-ql4.2 


Retinoblastoma 
Osteosarcoma 
Bladder cancer, 


RET 


ret proto-oncogene (multiple endocrine 
neoplasia and medullary thyroid carcinoma 1 , 
Hirschsprung disease) 


10qll.2 


Multiple endocrine neoplasia IIA 
Medullary thyroid 


RUNX1 


runt-related transcription factor 1 (acute 
myeloid leukemia 1; amll oncogene) 


21q22.3 


Leukemia, acute myeloid 
Platelet disorder, familial, with 


SCLC1 


354 


3p23-p21 


Small-cell cancer of lung 


SLC22A1L 


solute carrier family 22 (organic cation 
transporter), member 1 -like 


llpl5.5 


Breast cancer 

Rhabdomyosarcoma 

Lung 


SLC26A3 


solute carrier family 26 , member 3 


7q22-q31.1 


Colon cancer 

Chloride diarrhea, congenital, Finnish type 


SMARCB1 


SWI/SNF related, matrix associated, actin 
dependent regulator of chromatin, subfamily 
b, member 1 


22qll 


Rhabdoid tumors 

Rhabdoid predisposition syndrome, familial 


SRC 


v-src avian sarcoma (Schmidt-Ruppin A-2) 
viral oncogene homolog 


20ql2-ql3 


Colon cancer, advanced 


SSTR2 


somatostatin receptor 2 


17q24 


Lung cancer, small cell 


ST11 


suppression of tumorigenicity 1 1 (pancreas) 


3p25 


Pancreatic endocrine tumors 


ST12 


suppression of tumorigenicity 12 (prostate) 


lOpter-ql 1 


Prostate adenocarcinoma 1 
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ST3 


suppression of tumorigenicity 3 


llql3 


Cervical carcinoma 


ST8 


suppression of tumorigenicity 8 (ovarian) 


6q26-q27 


Ovarian cancer, serous 


TACSTD2 


tumor-associated calcium signal transducer 2 


Ip32-ql2 


Corneal dystrophy, gelatinous drop-like 


TCF7L2 


transcription factor 7-like 2 (T-cell specific, 
HMG-box) 


10q25.3 


Colorectal cancer 


TGFBR2 


transforming growth factor, beta receptor II 
(70-80kD) 


3p22 


Colon cancer 

Colorectal cancer, hereditary nonpolyposis, 
type 6 


THPO 


thrombopoietin (myeloproliferative leukemia 
virus oncogene ligand, megakaryocyte growth 
and development factor) 


3q26.3-q27 


Thrombocythemia, essential 


TNFRSF10B 


tumor necrosis factor receptor superfamily, 
member 1 Ob 


8p22-p21 


Squamous cell carcinoma, head and neck 


TNFRSF11A 


tumor necrosis factor receptor superfamily, 
member 11a, activator of NFKB 


18q22.1 


Osteolysis, familial expansile 
Paget disease of bone, 


TNFRSF1A 


tumor necrosis factor receptor superfamily, 
member 1A 


12pl3.2 


Periodic fever, familial 


TNFRSF6 


tumor necrosis factor receptor superfamily, 
member 6 


10q24.1 


Autoimmune lymphoproliferative syndrome 


TNFSF5 


tumor necrosis factor (ligand) superfamily, 
member 5 (hyper-IgM syndrome) 


Xq26 


Immunodeficiency, X-linked, with hyper-IgM 


TNFSF6 


tumor necrosis factor (ligand) superfamily, 
member 6 


lq23 


Systemic lupus erythematosus, susceptibility 
to 


TNF 


tumor necrosis factor (TNF superfamily, 
member 2) 


6p21.3 


Malaria, cerebral, susceptibility to 
Septic shock 


TOC 


tylosis with oesophageal cancer 


17q24 


Tylosis with esophageal cancer 


TP53 


tumor protein p53 (Li-Fraumeni syndrome) 


17pl3.1 


Colorectal cancer 
Li-Fraumeni syndrome 


TP73 


tumor protein p73 


lp36 


Neuroblastoma 


TSG101 


tumor susceptibility gene 101 


Ilpl5.2-pl5.1 


Breast cancer 


VMGLOM 


venous malformation with glomus cells 


Ip22-p21 


Glomus tumors, multiple 


WT1 


Wilms tumor 1 


llpl3 


Wilms tumor, type 1 
Denys-Drash syndrome 
Frasier 


WT2 


Wilms tumor 2 


llpIS.5 


Wilms tumor, type 2 
Adrenocortical carcinoma, hereditary 



In another example, the at least one protein of interest includes a 
cytokine. Many diseases, including neurodegenerative (e.g., Alzheimer's 
disease) and autoimmune (e.g., rheumatoid arthritis, multiple sclerosis and the 
like) diseases are caused or accompanied by inflammation, resulting in 
infiltration of leukocytes into the inflicted tissue(s). In these diseases 
proinflammatory cytokines and chemokines are believed to play a pivotal role 
in the attraction of leukocytes to the site of inflammation and in the initiation 
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and progression of the inflammatory process. In rheumatoid arthritis, for 
example, the role of proinflammatory cytokines, particularly TNF-a and IL-1, 
in disease manifestation has been intensively studied and explored in 
experimental models that have been expanded to clinical trials. Other 

5 cytokines such as IL-4, TGF-p, IL-8, IL-17, IL-10, IL-11, IL-12 and IL-15 
have also been implicated in the regulation of rheumatoid arthritis. Such 
regulation can be attributed to either their direct effect on disease 
manifestation, their synergistic effect with other proinflammatory 
cytokines/chemokines, or their involvement in the regulation of chemokine 

10 transcription, and production. 

Chemokines are chemoattractant cytokines that mediate leukocyte 
attraction and recruitment at the site of inflammation. Based on the positions 
of the first two cysteines, chemokines can be divided into four highly 
conserved but distinct supergene families, C-C, C-X-C, C and C-X3-C. The 

15 C-C family is primarily involved in the activation of endothelium and 
chemoattraction of T cells and monocytes to the site of inflammation. The 
protective competence of anti-C-C chemokine based immunotherapy has been 
demonstrated in experimental autoimmune encephalomyelitis (EAE), and 
rheumatoid arthritis. 

20 Neutralizing the activity of chemokines as a way to treat various 

diseases has been explored by many researchers. For example, in a recent 
study neutralizing antibodies to epithelial neutrophil activating peptide 78 
(ENA-78) were found capable of inhibiting the development of AA if 
administered before but not after the onset of disease [92]. In another recent 

25 study, Barnes et al. [93] used anti-human RANTES to ameliorate AA in the 
Lewis rat. Gong et aL [94] used an antagonist of Monocyte Chemoattractant 
Protein 1 (MCP-1) to inhibit arthritis in the MRL-lpr mouse model. Using a 
streptococcal cell wall induced arthritis model it has been shown that anti-IL-4 
and anti MCP-1 antibodies block the disease [95]. The same study 

30 demonstrated that neutralizing IL-4 by itself, leads to a marked reduction in 
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MCP-1 mRNA transcription at the autoimmune site and to inhibition of the 
development of disease which further implicates MCP-1 in playing an active 
role in arthritis development. 

In yet another example, the at least one protein of interest includes a 
5 protein, e.g., a surface protein, of a pathogen, such as a viral pathogen, a 
bacterial pathogen or a parasite (either mono or multicellular parasite). 

The major histocompatibility complex (MHC) is a complex of antigens 
encoded by a group of linked loci, which are collectively termed H-2 in the 
mouse and HLA in humans. The two principal classes of the MHC antigens, 

10 class I and class II, each comprise a set of cell surface glycoproteins which 
play a role in determining tissue type and transplant compatibility. In 
transplantation reactions, cytotoxic T-cells (CTLs) respond mainly against 
foreign class I glycoproteins, while helper T-cells respond mainly against 
foreign class II glycoproteins. 

15 Major histocompatibility complex (MHC) class I molecules are 

expressed on the surface of nearly all cells. These molecules function in 
presenting peptides which are mainly derived from endogenously synthesized 
proteins to CD8+ T cells via an interaction with the ap T-cell receptor. The 
class I MHC molecule is a heterodimer composed of a 46-kDa heavy chain 

20 which is non-covalently associated with the 12-kDa light chain p-2 
microglobulin. Class I MHC-restricted peptides, which are traditionally 
assumed to be 8-10-amino acid-long, bind to the heavy chain al-a2 groove via 
two or three anchor residues that interact with corresponding binding pockets 
in the MHC molecule. The p-2 microglobulin chain plays an important role in 

25 MHC class I intracellular transport, peptide binding, and conformational 
stability [76]. For most class I molecules, the formation of a heterodimer 
consisting of the MHC class I heavy chain, peptide (self or antigenic) and p-2 
microglobulin is required for biosynthetic maturation and cell-surface 
expression [76]. 
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Research studies performed on peptide binding to class I MHC 
molecules enable to define specific MHC motifs functional in displaying 
peptides derived from viral or tumor antigens that are potentially immunogenic 
and might elicit specific response from cytotoxic T lymphocytes (CTLs) 
[77,78]. 

Soluble MHC multimers posses a high avidity for T-cells since they 
provide multi-point binding of TCRs with their MHC-peptide ligands. As 
such, multimeric forms (tetramers) of MHC-peptide complexes have been the 
center of much interest recently, because they can be used for direct 
phenotypic characterization of T cell responses in normal as well as 
pathological conditions, thus, providing insight into the pathopysiology and 
mechanisms of various diseases. Recombinant soluble and secreted MHC 
class I and class II complexes including single chain MHC are described in 
[79-91] which are incorporated herein by references. 

There are several thousands of MHC genes, some of which were 
cloned. Table 5 below associates the MHC genes into classes and types (6). 
The sequences of the known MHC genes can be found in the Kabat database 
( http://immuno.bme.nwu.edu/ ). 



TABLE 5 





Type 


Number of genes 


MHC Class I 


A,B,C 


1014 


MHC class II A chain 


DR DQ DP 


348 


MHC class II B chain 


DR DQ DP 


1680 



Genes encoding MHC of particular haplotypes can be readily isolated 
using techniques well known in the art and reconstructed to encode soluble 
MHC molecules essentially as exemplified in the Examples section that 
follows. Such well known techniques include, for example, PCR 
amplification, enzymatic digestion and ligation. 
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According to a presently preferred embodiment of the present invention 
analyzing the peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype is by mass spectrometry, mass charge 
ratio and collision induced disintegration. Edman degradation can also be 
5 employed in certain cases where a sufficient amount of the pure peptide 
becomes available. 

The identification of the amino acid sequence of a peptide in 
accordance with the teachings of the present invention is preferably effected 
by comparison of the data collected by mass spectrometry, mass charge ratio 

10 and collision induced disintegration to putative data of mass spectrometry, 
mass charge ratio and collision induced disintegration of known proteins. 

As used herein in the specification and in the claims section below the 
term "peptide" includes native peptides (either degradation products or 
synthetically synthesized peptides) and further to peptidomimetics, such as 

is peptoids and semipeptoids which are peptide analogs, which may have, for 
example, modifications rendering the peptides more stable while in a body, or 
more immunogenic. 

Such modifications include, but are not limited to, cyclization, N 
terminus modification, C terminus modification, peptide bond modification, 

20 including, but not limited to, CH2-NH, CH2-S, CH2-SO, 0=C-NH, CH2-O, 
CH2-CH2, S=C-NH, CH=CH or CF=CH, backbone modification and residue 
modification. Methods for preparing peptidomimetic compounds are well 
known in the art and are specified in Quantitative Drug Design, C.A. Ramsden 
Gd., Chapter 17.2, F. Choplin Pergamon Press (1992), which is incorporated 

25 by reference as if fully set forth herein. Further detail in this respect are 
provided hereinunder. 

As used herein in the specification and in the claims section below the 
term "amino acid" is understood to include the 20 naturally occurring amino 
acids; those amino acids often modified post-translationally in vivo, including 

30 for example hydroxyproline, phosphoserine and phosphothreonine; and other 
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unusual amino acids including, but not limited to, 2-aminoadipic acid, 
hydroxylysine, isodesmosine, nor-valine, nor-leucine and ornithine. 
Furthermore, the term "amino acid" includes both D- and L-amino acids. 
Further elaboration of the possible amino acids usable according to the present 
5 invention and examples of non-natural amino acids useful in MHC class I, type 
A2, recognizable peptide antigens are given hereinunder. Other anchor 
residues are known for other MHC molecules. 

Thus, assume the following positions (P1-P9) in a 9-mer peptide: 
P 1 -P2-P3-P4-P5-P6-P7-P8-P9 

10 The P2 and P9 positions include the anchor residues which are the main 

residues participating in binding to A2 MHC molecules. Amino acid resides 
engaging positions P2 and P9 are hydrophilic aliphatic non-charged natural 
amino (examples being Ala, Val, Leu, He, Gin, Thr, Ser, Cys, preferably Val 
and Leu) or of a non-natural hydrophilic aliphatic non-charged amino acid 

15 (examples being norleucine (Nle), norvaline (Nva), a-aminobutyric acid). 
Positions PI and P3 are also known to include amino acid residues which 
participate or assist in binding to MHC molecules, however, these positions 
can include any amino acids, natural or non-natural. The other positions are 
engaged by amino acid residues which typically do not participate in binding, 

20 rather these amino acids are presented to the immune cells. Further details 
relating to the binding of peptides to MHC molecules can be found in 
reference 1 17, see Table V thereof, in particular. 

Hydrophilic aliphatic natural amino acids at P2 and P9 can be 
substituted by synthetic amino acids, preferably Nleu, Nval and/or a 

25 -aminobutyric acid. P9 can be also substituted by aliphatic amino acids 
of the general formula -HN(CH2)nCOOH, wherein n = 3-5, as well as by 
branched derivatives thereof, such as, but not limited to, 
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-NH(CH2) n -COON 

I 
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R 

wherein R is, for example, methyl, ethyl or propyl, located at any one or more 
of the n carbons. 

The amino terminal residue (position PI) can be substituted by 
5 positively charged aliphatic carboxylic acids, such as, but not limited to, 
H 2 N(CH 2 ) n COOH, wherein n = 2-4 and H 2 N-C(NH)-NH(CH 2 ) n COOH, 
wherein n = 2-3, as well as by hydroxy Lysine, N-methyl Lysine or ornithine 
(Orn). Additionally, the amino terminal residue can be substituted by enlarged 
aromatic residues, such as, but not limited to, H 2 N-(C6H6>CH 2 -COOH, 

10 p-aminophenyl * alanine, H 2 N-F(NH)-NH-(C6H6)-CH 2 -COOH, 
p-guanidinophenyl alanine or pyridinoalanine (Pal). These latter residues may 
form hydrogen bonding with the OH" moieties of the Tyrosine residues at the 
MHC-1 N-terminal binding pocket, as well as to create, at the same time 
aromatic-aromatic interactions. 

15 Derivatization of amino acid residues at positions P4-P8, should these 

residues have a side-chain, such as, OH, SH or NH 2 , like Ser, Tyr, Lys, Cys or 
Orn, can be by alkyl, aryl, alkanoyl or aroyl. In addition, OH groups at these 
positions may also be derivatized by phosphorylation and/or glycosylation. 
These derivatizations have been shown in some cases to enhance the binding 

20 to the T cell receptor. 

Longer derivatives in which the second anchor amino acid is at position 
P10 may include at P9 most L amino acids. In some cases shorter derivatives 
are also applicable, in which the C terminal acid serves as the second anchor 
residue. 

25 Cyclic amino acid derivatives can engage position P4-P8, preferably 

positions P6 and P7. Cyclization can be obtained through amide bond 
formation, e.g., by incorporating Glu, Asp, Lys, Om, di-amino butyric (Dab) 
acid, di-aminopropionic (Dap) acid at various positions in the chain (-CO-NH 
or -NH-CO bonds). Backbone to backbone cyclization can also be obtained 

30 through incorporation of modified amino acids of the formulas 



H-N((CH 2 ) n -COOH)-C(R)H-COOH or H-N((CH 2 ) n -COOH)-C(R)H-NH 2 , 
wherein n = 1-4, and further wherein R is any natural or non-natural side chain 
of an amino acid. As stated above, the data presented herein relates to the 
residues of the most abandoned MHC molecule - MHC class I, type A2. This 
5 data was collected over the years via the detailed analysis of thousands of 
peptides that bind to MHC-I, A2. It will be appreciated that the method of the 
present invention allows the collection of data and analysis of peptides that 
bind any other to MHC molecule. 

Cyclization via formation of S-S bonds through incorporation of two 

10 Cys residues is also possible. Additional side-chain to side chain cyclization 
can be obtained via formation of an interaction bond of the formula 
-(-CH2-) n -S-CH2-C-, wherein n = 1 or 2, which is possible, for example, 
through incorporation of Cys or homoCys and reaction of its free SH group 
with, e.g., bromoacetylated Lys, Om, Dab or Dap. 

15 Peptide bonds (-CO-NH-) within the peptide may be substituted by 

N-methylated bonds (-N(CH 3 )-CO-), ester bonds (-C(R)H-C-0-0-C(R)-N-), 
ketomethylen bonds (-CO-CH2-), a-aza bonds (-NH-N(R)-CO-), wherein R is 
any alkyl, e.g., methyl, carba bonds (-CH2-NH-), hydroxyethylene bonds 
(-CH(OH)-CH2-) 5 thioamide bonds (-CS-NH-), olefinic double bonds 

20 (-CH=CH-), retro amide bonds (-NH-CO-), peptide derivatives 
(-N(R)-CH 2 -CO-), wherein R is the "normal" side chain, naturally presented 
on the carbon atom. 

These modifications can occur at any of the bonds along the peptide 
chain and even at several (2-3) at the same time. Preferably, but not in all 

25 cases necessary, these modifications should exclude anchor amino acids. 

Natural aromatic amino acids, Trp, Tyr and Phe, may be substituted for 
synthetic non-natural acid such as TIC, naphthylelanine (Nol), ring-methylated 
derivatives of Phe, halogenated derivatives of Phe or o-methyl-Tyr. 

Tables 6-7 below list all of the naturally occurring amino acids (Table 

30 6) and some of the non-conventional or modified amino acids (Table 7). 
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TABLE 6 



Amino Acid 


Three-Letter Abbreviation 


One-letter Symbol 


Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


N 


Aspartic acid 


Asp 


D 


Cysteine 


Cys 


C 


Glutamine 


Gin 


Q 


Glutamic Acid 


Glu 


E 


Glycine 


Gly 


G 


Histidine 


His 


H 


Isoleucine 


lie 


I 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 


Any amino acid as above 


Xaa 


X 


Table 7 



Non-conventional amino acid 


Code 


Non-conventional amino acid 


Code 


a-aminobutyric acid 


Abu 


L-N-methylalanine 


Nmala 


a- amino - a-methy lbu tyr ate 


Mgabu 


L-N-memylarginine 


Nmarg 


aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 


arninonorbornyl- 


Norb 


L-N-methylglutamine 


Nmgin 


carboxylate 




L-N-methylglutamic acid 


Nmglu 


cyclohexylalanine 


Chexa 


L-N-methylhistidine 


Nmhis 


cyclopentylalanine 


Cpen 


L-N-methylisolleucine 


Nmile 
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D-alanine 


Dal 


T -N-mpthvllpupinp 

1—/ 1> illVLllj' UwUvUlC 


NtyiIpii 

1.^1 1111 Vkl 


D-arcinine 




T -N-rnpthvllvcirip 

-L^ J. ^ U1C HI jr 11 OlJ.lV 


l^liUjf o 


D-aspartic acid 


J) asp 


T .-N-methvltnethinnine 


TsJmtnpt 

1 ^1 1 1 11 11 V I 


D-cysteine 


Dcys 


T ,,-N-methvlnnrlpiicinp 


Nmnlp 

1 > 1111 11 V 


D-glutamine 


Dgln 


I ^-N-methvlnorvaline 


Nmnva 

x ^ iiiii v a 


D -glutamic acid 


Delu 


T v-N-Tnethvlornithine 


Nmom 

lllllV 1 ! 11 


D-histidine 


Dhis 


T -N-mpthvlnhprivlalaniTiP 

J_/ 1 ^ iliVUljf 1 IJUVllj' ICllOJ LlUv 


1>111IJ11V 


D-isoleucine 


Dile 


T -N-mpthvlnrnlinp 


Nmnrn 

l^tlllLll \J 


D-leucine 


Dleu 


T -Nf-mptfiv1<jpt*inp 

1— < l^t lllv LLlj 111V 


1~ llloVl 


T")-1v^inp 




j-j— -ine my i ull vuiiiiiv 


1> 111 LI 11 


D-tnethionine 

X-^ lllv H 1 1 V</ Mill IV 


Dmet 


T -M-TTiftVivl tT*vr\tr\'nh An 
±_/~l ^1 "Hlv Llljr 1 U J' JJ i v jjiicui 


i^iiiup 


D-orni th in e 


Dorn 


T -^^-mf tlrvl f~VT*OQi np 

L 1 — 111VUJ. YlLjrl wo 11 IV 




D-phenylalanine 


Dphe 


T -N-rnpthvl vfilinp 
j — i i uiv ui y i v aiiiiv 


1 'I 111 V dl 


D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 


D-serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 


D -threonine 


Dthr 


L-norleucine 


Nle 


D-tryptophan 


Dtrp 


L-norvaline 


Nva 


J-V Lyi UoJJlC 


j-Jiyr 


a-methyl-aminoisobutyrate 


Maib 


T~^_t ro lino 

j-/- Valine 


uvai 


a-methyl-y-aminobutyrate 


Mgabu 


D-a-methylalanine 


umaia 


a-methylcyclohexylalanine 


Mchexa 


D-a-methylarginine 


-L/marg 


a-methylcyclopentylalanine 


Mcpen 


D-Ct-methylasparagine 


L/mdsn 


a-methyl-a-napthylalanine 


Ma.ii a p 


D-a-methylaspartate 


umasp 


a-methylpeniciUamine 


Mpen 


D-a-methylcysteine 


Dmcys 


N- (4 - aminobutyl) glyc ine 


FMglU 


D-a-methylglutamine 


Dmgln 


N- (2 - amino ethy l)glyc ine 


Naeg 


D-a-methylhistidine 


Dmhis 


N-(3 -aminopropyl)glycine 


INorn 


D-Ot-methylisoleucine 


Dnule 


N- amino-a-methylbutyrate 


Nmaabu 


D-a-methylleucine 


Dmleu 


a-napthylalanine 


Anap 


D-OC-methyllysine 


Dmlys 


N-b enzylgly c ine 


iNpne 


D-OC-methylmethionine 


Ommpt 

-L/llUllC'l 


i> ^z. _ vcu uainy i v my i jQiy viiic 


i>gin 


D-a-methylorni thine j 


Dmorn 


N-(carbamylmethyl)glycine 


Nasn 


D-a-methylphenylalanine 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 


D-a-methylproline 


Dmpro 


N-(carboxymethyl)glycine 


Nasp 
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D-a-methylserine 


Dixiser 


iN-cyuiouuiyigiycine 


1NCDUI 


D-a-methylthreonine 


J-JIUUU 


iN^cyuioncpi^igiyciiic 


iNcnep 


D-a-methyltryptophan 


Dixitrp 


l^^cyUxUIlwt y IgiyciIxC 


iNcnex 


D-a-methyltyrosine 


T^mtv 
LJiiiiy 


rN -oyciuuecyigiycine 


in c dec 


D-a-methylvaline 


T*)m\/sil 

LJlii Veil 


in -cycivj uuu.ee igiyt/ine 


^T/"» n f\ tA 

INCQOU 


D-a-methylalnine 


J_-/ 1 III la la 


iN-eyciuuLiyigiycinc 


IN COCl 


D-a-methylarginine 


j-/niiiarg 


in -cyciopropyigiycine 


Ncpro 


D-a-methylasparagine 


Dnmasn 


N-cycloundecylglycine 


Ncund 


D-a-methylasparatate 


Dnmasp 


N-(2,2-diphenylethyl)glycine 


Nbhm 


D-a-methylcysteine 


Dnmcys 


N-(3 ,3 -diphenylpropyl)glycine 


Nbhe 


D-N-methylleucine 


Dnmleu 


N-(3-indolylyethyl) glycine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-rnethvl-V-aminobutvrate 


Nmgabu 


N-methylcyclohexylalanine 


Nmchexa 

X iJlllvilvAU 


D -N-m e th vl m e th i on i n e 


Dnmmet 

L/llli 1 1 1 ly I 


D-N-methylorni thine 


Dnmom 

J—*' xxxxxv/x xx 


N-m eth vl eve 1 nnen tvl al an i n p 




N-me th vl t? 1 vc ine 


Nala 


T")-N-m pth vl "nfi pn vl 1 a n i n p 


T^intnnlif* 


N-methvlaminoi sohutvrate 


Nrnaih 


T"")-.N-m pth vl nrn 1 i n p 
A-/ i ^ i-llt/ uiy vj iiiic- 




N-f 1 -methvloroovnff lvcine 


Nile 


T^-TsI-mpt Vi vl ^pti'tip 


Dnmcpr 


N-f2-iiiethvloroDvlWlvcine 


Nile 


X) -N-m pth vl <5pti* n p 




N-(2-methylpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophan 


Dnmtrp 


N-( 1 -methylethyl)glycine 


Nva 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 


D-N-methylvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 


y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


L-t-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamine 


Pen 


j^-nornopnenyiaianine 


ripne 


L-a-methylalanine 


Mala 


L-a-methylarginine 


Marg 


L-a-methylasparagine 


jviasn 


L-a-methylaspartate 


iviasp 


L-OC-methyl-t-butylglycine 


Mtbug 


L-a-methylcysteine 


ivicys 


L-metbylethylglycine 


Metg 


L-a-methylglutamine 


Mgln 


L-a-methylglutamate 


Mglu 


L-a-methylhistidine 


Mhis 


L-a-methylhomo phenylalanine 


Mhphe 


L-a-methylisoleucine 


Mile 


N-(2-methylthioethyl)glycine 


Nmet 


D-N-methylglutamine 


Dnmgln 


N-(3 -guanidinopropyl)glycine 


Narg 



35 



D-N-methylglutamate 


Dnmglu 


N-( 1 -hydroxyethyl)glycine 


Nthr 


D-N-methylhistidine 


Dnmhis 


N-(hydroxyethyl)glycine 


Nser 


D-N-methylisoleucine 


Dnmile 


N-(imidazolylethyl)glycine 


Nhis 


D-N-methylleucine 


Dnmleu 


N-(3-indolylyethyl)glycine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-niethyl-y-aniinobutyrate 


Nmgabu 


N-me thy lcyc lohexy 1 alanine 


Nmchexa 


D-N-methvl methionine 


Dnmmet 


D-N-methylo mithine 


Dnmom 


N-m e th vl c \/c 1 on en tvl al an i n e 


Niripnen 


N-methylglycine 


Nala 


D-N-methvlnhenvlalanine 


Dnmnhe 


N-methylaminoisobutyrate 


Nmaih 


D-N-methvlnroline 




N-( 1 -methylpropyl)glycine 


Nile 


T)-N-methvl serine 




N-(2-methylpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophan 


Dnmtrp 


N-( 1 -methylethyl)glycine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napmylalanine 


Nmanap 


D-N-methylvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 


y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


L-t-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamine 


Pen 


T -h ri in rvnh en \/1 alanine 


rrpiic 


L-a-methylalanine 




L-a-methylarginine 


^/larg 


L-a-methylasparagine 


jviasn 


L-Cl-methylaspartate 


ivioop 


L-Ot-methyl-t-butylglycine 


IVllDUg 


L-(X-methylcysteine 




i_/-iiic my ic my igiy o uic 


N^etg 


L-a-methylglutamine 


N*Igln 


L-a-methylglutamate 


Mglu. 


L-CC-methyihistidine 


Mhis 


L-a-methylhomophenylalanine 


ivinpne 


L-a-methyiisoleucine 


ivine 


iN-^z-meinyiimoetnyi jgiycine 


\T mo i 

jNmei 


L-Ot-methylleucine 


iviieu 


L-Ct-methyllysine 


Mlys 


L-a-methylmethionine 


ivimeL 


L-a-methylnorleucine 


lvinie 


L-a-methylnorvaline 


IVillVd 


L-a-methylorni thine 


Morn 


L-a-methylphenylalanine 


Mphe 


L-a-methylproline 


Mpro 


L-a-methylserine 


mser 


L-a-methylthreonine 


Mthr 


L-a-methylvaline 


Mtrp 


L-a-methyltyrosine 


Mtyr 


L-a-methylleucine 


Mval j 
Nnbhm 


L-N-methylhomophenylalanine 


Nmhphe 


N-(N-(2,2-diphenylethyl) 




N-(N-(3 ,3-diphenylpropyl) 




carbamylmethyl-glycine 


Nnbhm 


carbamylmethyl( 1 )glycine 


Nnbhe 
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1 -carboxy- 1 -(2,2-diphenyl 


Nmbc 






ethylamino)cyclopropane 









A peptide according to the present invention can be used in a self 
standing form or be a part of a larger moiety such as a protein or a display 
moieties such as a display bacterium, a display phage or preferably a display 
5 cell. 

Additionally, a peptide according to the present invention includes at 
least five, optionally at least six, optionally at least seven, optionally at least 
eight, optionally at least nine, optionally at least ten, optionally at least eleven, 
optionally at least twelve, optionally at least thirteen, optionally at least 

10 fourteen, optionally at least fifteen, optionally at least sixteen or optionally at 
least seventeen, optionally between seventeen and twenty five or optionally 
between twenty five and at least thirty amino acid residues (also referred to 
herein interchangeably as amino acids). 

Accordingly, as used herein the term "amino acid" or "amino acids" is 

15 understood to include the 20 naturally occurring amino acids; those amino 
acids often modified post-translationally in vivo, including, for example, 
hydroxyproline, phosphoserine and phosphothreonine; and other unusual 
amino acids including, but not limited to, 2-aminoadipic acid, hydroxylysine, 
isodesmosine, nor-valine, nor-leucine and ornithine. Furthermore, the term 

20 "amino acid" includes both D- and L-amino acids. 

As used herein the phrase "derived from a protein" refers to peptides 
derived from the specified protein or proteins and further to homologous 
peptides derived from equivalent regions of proteins homologous to the 
specified proteins of the same or other species, provided that these peptides are 

25 effective as vaccines, such as anti-tumor vaccines. The term further relates to 
permissible amino acid alterations and peptidomimetics designed based on the 
amino acid sequence of the specified proteins or their homologous proteins. 
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As used herein the phrase M anti-tumor vaccines" refers to a vaccines 
effective in preventing the development of, or curing, cancer, including 
primary tumor and/or metastases. 

The peptides of the invention can be administered per se or as an active 
5 ingredient in a pharmaceutical composition which may further include a 
pharmaceutically acceptable carrier. Preferably, one or more peptides of the 
invention are presented in context of an antigen presenting cell. The most 
common cells used to load antigens are bone marrow and peripheral blood 
derived dendritic cells (DC), as these cells express costimulatory molecules 

10 that help activation of CTL. Nevertheless, the peptide presenting cell can also 
be a macrophage, a B cell or a fibroblast. Presenting the peptide can be 
effected by a variety of methods, such as, but not limited to, (a) transforming 
the presenting cell with at least one polynucleotide (e.g., DNA) encoding at 
least one peptide; (b) loading the presenting cell with at least one 

15 polynucleotide (e.g., RNA) encoding at least one peptide; (c) loading the 
presenting cell with at least one peptide (e.g., synthetic peptide); and (d) 
loading the antigen presenting cell with at least one longer polypeptide (e.g., 
purified) including at least one peptide. Loading can be external or internal. 
The polynucleotide, peptide or longer polypeptide can be fused to internalizing 

20 sequences, antennapedia sequences or toxoid sequences or to helper 
sequences, such as, but not limited to, heat shock protein sequences. 

As used herein a "pharmaceutical composition" refers to a preparation 
of one or more of the peptides described herein, with other chemical 
components such as pharmaceutically suitable carriers and excipients. The 

25 purpose of a pharmaceutical composition is to facilitate administration of a 
compound to a subject. 

Hereinafter, the term "pharmaceutically acceptable carrier" refers to a 
carrier or a diluent that does not cause significant irritation to a subject and 
does not abrogate the biological activity and properties of the administered 
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compound. Examples, without limitations, of carriers are propylene glycol, 
saline, emulsions and mixtures of organic solvents with water. 

Herein the term "excipient" refers to an inert substance added to a 
pharmaceutical composition to further facilitate administration of a compound. 
5 Examples, without limitation, of excipients include calcium carbonate, calcium 
phosphate, various sugars and types of starch, cellulose derivatives, gelatin, 
vegetable oils and polyethylene glycols. 

According to a preferred embodiment of the present invention, the 
pharmaceutical carrier is an aqueous solution of lactic acid. 

10 In this respect, it should be pointed out that some of the peptides of the 

present invention, according to preferred embodiments, are readily soluble in 
aqueous media and are therefore easily formulated. 

Techniques for formulation and administration of drugs may be found 
in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, 

15 latest edition, which is incorporated herein by reference. 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, transdermal, intestinal or parenteral delivery, including 
intramuscular, subcutaneous and intramedullary injections as well as 
intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or 

20 intraocular injections. 

Pharmaceutical compositions of the present invention may be 
manufactured by processes well known in the art, e.g., by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing processes. 

25 Pharmaceutical compositions for use in accordance with the present 

invention thus may be formulated in conventional manner using one or more 
pharmaceutically acceptable carriers comprising excipients and auxiliaries, 
which facilitate processing of the active compounds into preparations which 
can be used pharmaceutically. Proper formulation is dependent upon the route 

30 of administration chosen. 
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For injection, the peptides of the invention may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as 
Hank's solution, Ringer's solution, or physiological saline buffer with or 
without organic solvents such as propylene glycol, polyethylene glycol and the 
like. For transmucosal administration, penetrants are used in the formulation. 
Such penetrants are generally known in the art. 

For oral administration, the peptides can be formulated readily by 
combining the active compounds with pharmaceutically acceptable carriers 
well known in the art. Such carriers enable the peptides of the invention to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, and the like, for oral ingestion by a patient. Pharmacological 
preparations for oral use can be made using a solid excipient, optionally 
grinding the resulting mixture, and processing the mixture of granules, after 
adding suitable auxiliaries if desired, to obtain tablets or dragee cores. 

Suitable excipients are, in particular, fillers such as sugars, including 
lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for 
example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carbomethylcellulose and/or physiologically acceptable polymers such as 
polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, 
such as cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. 

Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used which may optionally contain gum 
arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium 
dioxide, lacquer solutions and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for 
identification or to characterize different combinations of active compound 
doses. 
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Pharmaceutical compositions, which can be used orally, include 
push-fit capsules made of gelatin as well as soft, sealed capsules made of 
gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules 
may contain the active ingredients in admixture with filler such as lactose, 
binders such as starches, lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be 
dissolved or suspended in suitable liquids, such as aqueous solution, fatty oils, 
liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be 
added. All formulations for oral administration should be in dosages suitable 
for the chosen route of administration. 

For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the peptides are conveniently 
delivered in the form of an aerosol spray presentation from a pressurized pack 
or a nebulizer with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichloro-tetrafluoroethane 
or carbon dioxide. In the case of a pressurized aerosol, the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of, e.g., gelatin for use in an inhaler or insufflator may be 
formulated containing a powder mix of the compound and a suitable powder 
base such as lactose or starch. 

The peptides described herein may be formulated for parenteral 
administration, e.g., by bolus injection or continuos infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampoules or in 
multidose containers with optionally, an added preservative. The compositions 
may be suspensions, solutions or emulsions in oily or aqueous vehicles, and 
may contain formulatory agents such as suspending, stabilizing and/or 
dispersing agents. 

Pharmaceutical compositions for parenteral administration include 
aqueous solutions of the active compound in water-soluble form. 
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Additionally, suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles 
include fatty oils such as sesame oil, or synthetic fatty acids esters such as 
ethyl oleate, triglycerides or liposomes. Aqueous injection suspensions may 
contain substances, which increase the viscosity of the suspension, such as 
sodium carboxymethyl cellulose, sorbitol or dextran. Optionally, the 
suspension may also contain suitable stabilizers or agents which increase the 
solubility of the peptides to allow for the preparation of highly concentrated 
solutions. 

Alternatively, the active ingredient may be in powder form for 
constitution with a suitable vehicle, e.g., sterile, pyrogen-free water, before 
use. 

The peptides of the present invention may also be formulated in rectal 
compositions such as suppositories or retention enemas, using, e.g., 
conventional suppository bases such as cocoa butter or other glycerides. 

The pharmaceutical compositions herein described may also comprise 
suitable solid of gel phase carriers or excipients. Examples of such carriers or 
excipients include, but are not limited to, calcium carbonate, calcium 
phosphate, various sugars, starches, cellulose derivatives, gelatin and polymers 
such as polyethylene glycols. 

Pharmaceutical compositions suitable for use in context of the present 
invention include compositions wherein the active ingredients are contained in 
an amount effective to achieve the intended purpose. More specifically, a 
therapeutically effective amount means an amount of peptide effective to 
prevent, alleviate or ameliorate symptoms of pathology or prolong the survival 
of the subject being treated. 

Determination of a therapeutically effective amount is well within the 
capability of those skilled in the art, especially in light of the detailed 
disclosure provided herein. 
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For any peptide used in the methods of the invention, the therapeutically 
effective amount or dose can be estimated initially from activity assays in cell 
cultures and/or animals. For example, a dose can be formulated in animal 
models to achieve a circulating concentration range that includes the IC50 as 
5 determined by activity assays (e.g., the concentration of the test compound, 
which achieves a half-maximal inhibition of the proliferation activity). Such 
information can be used to more accurately determine useful doses in humans. 

Toxicity and therapeutic efficacy of the peptides described herein can 
be determined by standard pharmaceutical procedures in experimental animals, 
10 e.g., by determining the IC50 and the LD50 (lethal dose causing death in 50 % 
of the tested animals) for a subject compound. The data obtained from these 
activity assays and animal studies can be used in formulating a range of dosage 
for use in human. 

The dosage may vary depending upon the dosage form employed and 
15 the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of 
the patient's condition. (See e.g., Fingl, et al., 1975, in "The Pharmacological 
Basis of Therapeutics", Ch. 1 p.l). 

Dosage amount and interval may be adjusted individually to provide 
20 plasma levels of the active moiety which are sufficient to maintain therapeutic 
effects, termed the minimal effective concentration (MEC). The MEC will 
vary for each preparation, but can be estimated from in vitro and/or in vivo 
data, e.g., the concentration necessary to achieve 50-90 % inhibition of a 
proliferation of certain cells may be ascertained using the assays described 
25 herein. Dosages necessary to achieve the MEC will depend on individual 
characteristics and route of administration. HPLC assays or bioassays can be 
used to determine plasma concentrations. 

Dosage intervals can also be determined using the MEC value. 
Preparations should be administered using a regimen, which maintains plasma 
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levels above the MEC for 10-90 % of the time, preferable between 30-90 % 
and most preferably 50-90 %. 

Depending on the severity and responsiveness of the condition to be 
treated, dosing can also be a single administration of a slow release 
composition described hereinabove, with course of treatment lasting from 
several days to several weeks or until cure is effected or diminution of the 
disease state is achieved. 

The amount of a composition to be administered will, of course, be 
dependent on the subject being treated, the severity of the affliction, the 
manner of administration, the judgment of the prescribing physician, etc. 

Compositions of the present invention may, if desired, be presented in a 
pack or dispenser device, such as a U.S. Food and Drug Administration (FDA) 
approved kit, which may contain one or more unit dosage forms containing the 
active ingredient. The pack may, for example, comprise metal or plastic foil, 
such as a blister pack. The pack or dispenser device may be accompanied by 
instructions for administration. The pack or dispenser may also be 
accompanied by a notice associated with the container in a form prescribed by 
a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals, which notice is reflective of approval by the agency of the 
form of the compositions or human or veterinary administration. Such notice, 
for example, may be of labeling approved by the U.S. Food and Drug 
Administration for prescription drugs or of an approved product insert. 
Compositions comprising a chemical conjugate of the invention formulated in 
a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition, such 
as a cancer of a certain type, an autoimmune disease or an allergy. 

Peptides of the present invention may be packaged in kits, each such kit 
comprising a plurality of individual containers, each of which containing at 
least one peptide identified by the method of the present invention. Such a kit 
can be used for two purposes. First, an in vitro functional assay, such as the 
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CTL assay [20] or the ELISPOT assay [19] of, for example, cytokine (e.g., 
IL-2, TNF alpha or interferon gamma) production or development of 
cytotoxiciy using immune cells derived from a patient can be used to 
determine the immune response of the patient to each one of the peptides, 
5 which response to a large extent depends on the particular MHC haplotype of 
the patient. Second, once, a reactive peptide or peptides are identified, either 
by an individualized in vitro assay or from in silico data as is further detailed 
below, suitable peptide or peptides from the kit can be administered so as to 
treat the patient. 

10 According to another aspect of the present invention there is provided 

an electronic data storage device, storing, in a retrievable form, a plurality of 
sequences of peptides identified by the method described herein. Various 
other parameters, such as the parameters identified in the Tables provided in 
the Examples section that follows, can also be linked to the peptide sequences, 

15 in, for example, a table form or any other form. Preferably, the plurality of 
peptides are arranged at least according to their association with a pathology 
and further according to their ability of binding to MHC molecules of a 
particular haplotype. This in silico data can be used instead or in addition to 
the in vitro assays described above to match a most active peptide to treat a 

20 pathology of a certain patient having a particular pre identified MHC 
haplotype. Thus, look up tables associating a peptide with a protein with a 
gene, with a disease with a haplotype, and/or with an efficiency score can be 
constructed and used to best suit a peptide for treatment of a disease in an 
individualized way taking into account the MHC haplotype of the patient to be 

25 treated. Of course, individualized in vitro assays can be used to ascertain 
peptide selection. 

The electronic data storage device can, for example, be an 
electromagnetically or electro-optically readable device and it preferably forms 
a part of a server that is accessible by users through a communications 
30 network, such as the Internet, an intranet or an extranet, via a plurality of user 
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clients at the disposal of the users. A management software application 
manages the data stored in the data storage device and is preferably designed 
to support search and retrieval of information from the database and deposition 
of information into the database. 
5 Thus, further according to the present invention there is provided a 

method of eliciting an immune response against a protein of interest in a 
subject having a particular MHC haplotype. The method according to this 
aspect of the invention is effected by determining the subject's particular MHC 
haplotype; and administering to the subject an effective amount of at least one 

10 peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

Still further according to the present invention there is provided a 
method of eliciting an immune response against a protein of interest in a 
subject. The method is effected by using an individualized in vitro assay for 

15 determining an immune reactivity of an immune system of the subject to a 
plurality of peptides derived from the protein of interest; and administering to 
the subject an effective amount of at least one peptide derived from the protein 
of interest and which is capable of inducing predetermined sufficient immune 
reactivity. 

20 According to another aspect the present invention provides a method of 

treating a pathology by eliciting an immune response against a protein of 
interest in a subject having a particular MHC haplotype. The method is 
effected by determining the subject's particular MHC haplotype; and 
administering to the subject a therapeutic effective amount of at least one 

25 peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

As used herein the term "treating" includes prevention or cure of a 
pathology, such as a disease, syndrom or manifestation, effected by inhibiting, 
slowing or reversing the progression of the disease, syndrom or manifestation, 

30 substantially ameliorating clinical symptoms of a disease, syndrom or 
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manifestation or substantially preventing the appearance of clinical symptoms 
of a disease, syndrom or manifestation. 

As used herein the term "subject" refers to humans and animals having 
an MHC system, such as the HLA system in humans, in particular farm 
animals. It will be appreciated in this respect that the method of the present 
invention can be used to improve all kinds of peptide immunization via 
individualization for both human beings and animals. 

A variety of pathologies can be treated using the peptides of the present 
invention, including, but not limited to, cancers, infections, inflammations, 
autoimmune diseases, allergies, etc. The gist of the present invention with 
respect to treating pathologies lies in the fact that the present invention offers, 
for the first time, individualization of the vaccine to the MHC haplotype of the 
treated subject. 

Additional objects, advantages, and novel features of the present 
invention will become apparent to one ordinarily skilled in the art upon 
examination of the following examples, which are not intended to be limiting. 
Additionally, each of the various embodiments and aspects of the present 
invention as delineated hereinabove and as claimed in the claims section below 
finds experimental support in the following examples. 

EXAMPLES 

Reference is now made to the following examples, which together with 
the above descriptions, illustrate the invention in a non limiting fashion. 

Generally, the nomenclature used herein and the laboratory procedures 
utilized in the present invention include molecular, biochemical, 
microbiological and recombinant DNA techniques. Such techniques are 
thoroughly explained in the literature. See, for example, "Molecular Cloning: 
A laboratory Manual" Sambrook et al., (1989); "Current Protocols in 
Molecular Biology" Volumes I-III Ausubel, R. M., ed. (1994); Ausubel et al., 
"Current Protocols in Molecular Biology", John Wiley and Sons, Baltimore, 



Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", John 
Wiley & Sons, New York (1988); Watson et al., "Recombinant DNA", 
Scientific American Books, New York; Birren et al. (eds) "Genome Analysis: 
A Laboratory Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory 
5 Press, New York (1998); methodologies as set forth in U.S. Pat. Nos. 
4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5,272,057; "Cell Biology: A 
Laboratory Handbook", Volumes I-III Cellis, J. E., ed. (1994); "Culture of 
Animal Cells - A Manual of Basic Technique" by Freshney, Wiley-Liss, N. Y. 
(1994), Third Edition; "Current Protocols in Immunology" Volumes I-III 

io Coligan J. E., ed. (1994); Stites et al. (eds), "Basic and Clinical 
Immunology" (8th Edition), Appleton & Lange, Norwalk, CT (1994); Mishell 
and Shiigi (eds), "Selected Methods in Cellular Immunology", W. H. 
Freeman and Co., New York (1980); available immunoassays are extensively 
described in the patent and scientific literature, see, for example, U.S. Pat. 

15 Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 
3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 4,098,876; 
4,879,219; 5,011,771 and 5,281,521; "Oligonucleotide Synthesis" Gait, M. J., 
ed. (1984); "Nucleic Acid Hybridization" Hames, B. D., and Higgins S. J., 
eds. (1985); "Transcription and Translation" Hames, B. D., and Higgins S. J., 

20 eds. (1984); "Animal Cell Culture" Freshney, R. I., ed. (1986); "Immobilized 
Cells and Enzymes" IRL Press, (1986); "A Practical Guide to Molecular 
Cloning" Perbal, B., (1984) and "Methods in Enzymology" Vol. 1-317, 
Academic Press; "PCR Protocols: A Guide To Methods And Applications", 
Academic Press, San Diego, CA (1990); Marshak et al., "Strategies for Protein 

25 Purification and Characterization - A Laboratory Course Manual" CSHL Press 
(1996); all of which are incorporated by reference as if fully set forth herein. 
Other general references are provided throughout this document. The 
procedures therein are believed to be well known in the art and are provided 
for the convenience of the reader. All the information contained therein is 

30 incorporated herein by reference. 
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MATERIALS AND EXPERIMENTAL PROCEDURES 
Cell lines: 

The human cancer cell lines: PC3 (prostate cancer), UCI-107 and 
5 UCI-101 (both ovarian cancer), MDA-231 and MCF-7 (both breast cancer) 
were obtained from the ATCC. The human B-cell line C1R was a generous 
gift from Nick Zavazava. UCI-107, UCI-101, MDA-231 and MCF-7 cells 
were maintained in DMEM containing 10 % FCS, 1 mM glutamine, 0.1 mg/ml 
streptomycin and 100 units/ml penicillin. PC3 and C1R cells were maintained 

10 in RPMI 1640 containing 10 % FCS, 1 mM glutamine, 0.1 mg/ml 
streptomycin and 100 units/ml penicillin. For growing MCF-7 cells without 
estrogen, the cells were maintained in DMEM without sodium pyruvate and 
phenol red and containing 4 % FCS stripped of estrogen, 1 mM glutamine, 0.1 
mg/ml streptomycin and 100 units/ml penicillin. Culture media, and serum 

15 were obtained from GibcoBRL. 

RMA-S-HHD is a murine TAP-2 deficient lymphoma clone of 
C57BL/6 origin, transfected with HLA-A2.1/Db-p2m single chain (HHD) 
[23]. The RMA-S-HHD-B7. 1 cells transfected by the murine B7.1 
costimulatory molecule (CD80). EL4-HHD is the murine lymphoma cell line 

20 EL4 transfected by HHD. RMA-S-HHD, RMA-S-HHD-B7 1 and EL4-HHD 
were maintained in RPMI 1640 containing 10 % FCS and 1 mM glutamine, 
0.1 mg/ml streptomycin and 100 units/ml penicillin. After transfection, the 
cells were maintained in medium supplemented with 500 to 1000 |ig/ml of the 
antibiotic G418 (GibcoBRL). 

25 DNA: 

Plasmid HLA-A2/Q10^ 5 used for expression of soluble MHC, contains 
the first five exons of the HLA-A2 fused to exons 5 to 8 of the murine mutant 
Q10b, which lacks a functional transmembrane domain and is therefore 
secreted. This plasmid was a generous gift from D. Margulies, of the NIH 

30 [24]. Plasmid (php2m) was constructed to express the human 
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p2-microglobulin. It is based on the cDNA of human (32m (hp2m) isolated 
from PC3 cells and amplified by PCR using the following primers: 5 '-sense 
primer: 5'-AGATTCCCAAGCTTATGTCTCGCTCCGTGG-3' (SEQ ID 
NO:40) contained a restriction site for Hind III before the signal peptide and a 
5 3 ' antisense primer 5'- AGCTAGTCTAGATTATCACATGTCTC 
G ATCCC ACTT AAC-3 1 (SEQ ID NO:41) contained the restriction site for 
Xbal on the 3 ' end of p2m. The purified PCR product was cut with Hindlll 
and Xbal and ligated into the eukaryotic expression vector pCDNA-3.1 
(Invitrogen). Plasmid sHLA-A2 and sHLA-B7 contains the cDNA of the first 

io 4 exons of this alleles ligated into the plasmid pcDNA3.1 [34]. 
Cloning of sHLA-Cw4 was as follows: 
cDNA corresponding to the extracellular part of polymorphic MHC class I 
molecules isolated from peripheral blood lymphocytes was PCR amplified 
using the following primers: a 5 '-sense primer: 

15 5 f -AAGCTTATGCTGGTCATGGCGCCCCGAACC-3 f (SEQ ID NO: 371) 
which contained a restriction site for Hindlll before the signal peptide and a 3' 
antisense primer: 5 f -GGATCCTTAGATATCGGGGACGGTGGAC 
TGGG AAG ACGGCTC-3 1 (SEQ ID NO:372), which contained the restriction 
site for BamHI on the 3' end upstream the transmembrane domain of 

20 sHLA-Cw4. PCR conditions included 35 cycles of: 92 °C for 1 minute, 55 °C 
for 2 minutes and 72 °C for 3 minutes. The purified reaction product was cut 
with Hindlll and BamHI and ligated into the eukaryotic expression vector 
pCDNA-3.1 (Invitrogen). Cloning was verified by transformation into XL1 
Blue E. coli cells followed by PCR amplification of the inserted gene, using 

25 the oligonucleotides identified by SEQ ID NOs: 371 and 372. This 
verification reaction was performed under the PCR conditions described 
above, except for the the initial denaturation step, which was at 95 °C for 30 
seconds. Plasmid DNA was isolated by a DNA isolation kit which uses 
gravity-flow anion exchange tips (Qiagen Inc. USA). Insert DNA was 
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sequence analyzed using T7 and Sp6 primers (Promega Corp. USA). 
Antibodies and hybridomas: 

The hybridomas W6/32 and BB7.2, an anti-MHC class-I and 

anti-HLA-A2 respectively, and HB-149 an anti p2m were obtained from the 
5 ATCC. The antibodies were affinity purified using protein A-Sepharose 

CL-4B (Sigma) from mouse ascites fluid. 

Transfection of cancer cells and selection of clones secreting sMHC: 
Cell lines were co-trans fected with plasmid HLA-A2/Ql()b and with 

php2m, which conferred resistance to the antibiotic G418 or transfected only 
10 with the plasmids sHLA-A2, sHLA-B7 and sHLA-Cw4 that contained the 

antibiotic resistance. Cells were electroporated by use of a Gene Pulser 

(Bio-Rad) set at 280-300 mV 960 juF. Transfected cell clones were selected in 

G418 antibiotic and screened for those secreting sMHC to the growth medium. 

Secretion of sMHC was assayed by sandwich-ELISA with plates coated with 
15 the mAb BB7.2 (for sMHC-A2) or by HB-149 (for sMHC-B7) and sMHC and 

specifically sMHC-Cw4 were detected with the biotinilated mAb W6/32. 

Color was developed with ABTS (Sigma) catalyzed by streptavidin peroxidase 

(Sigma). 

Affinity purification of soluble MHC: 

20 Cultured cells, expressing the soluble MHC were grown to confluency 

in 150 mm plates. The culture medium was collected and residual cells were 
removed by centrifugation. Soluble MHC molecules were purified from the 
cleared culture medium by affinity chromatography on W6/32 antibody 
columns at 4 °C. The antibodies were coupled to NHS-activated agarose 

25 (Pharmacia) or to protein A Sepharose (Sigma) with n-methylpipelimidate 
(Sigma). The columns were washed with 0.5 M NaCl, 20 mM, Tris pH 8. The 
MHC molecules were eluted from the column with 0.1 M acetic acid at pH 3. 
Peptides were separated from the MHC complexes by boiling for five minutes 
in 10 % acetic acid followed by ultra- filtration through a 3 kDa Microcon 

30 (Amincon) [2]. 



51 

Synthetic peptides: 

Peptides were synthesized on AbiMed AMS 422 multiple peptide 
synthesizer (Abimed, Langenfeld, Germany) by Fmoc chemistry, precipitated 
with ether and used with or without further purification (HPLC). 
5 Peptides separation and analysis: 

The MHC bound peptides were resolved by reverse-phase HPLC on a 
0.1 ID fused silica capillaries with length of about 30 cm (J&W) slurry packed 
with POROS 10 R2 (PerSepetive Biosystems). The capillaries were fitted with 
electrospray needle made from 36-gauge stainless tubing (Small Parts Inc. 

io Miami Lakes, FL). A Rheodyne 9125 HPLC injector fitted with a 20 \x\ loop 
was used for loading the column. The peptides were resolved by a relatively 
long (90 minutes) linear gradient of 5 to 50 % acetonitrile with 0.1 % acetic 
acid, at a flow rate of about 1 jil/minute. The flow was electrosprayed directly 
from the HPLC column into an ion trap mass spectrometer (LCQ, Finnigan). 

15 The mass spectrometry analysis was done in the positive ion mode, using 
repetitively a full MS scan usually between 450 to 1500 atomic mass units 
(amu) followed by collision-induced decomposition (CID) of the dominant ion 
selected from the previous MS scan. In some cases the full MS was performed 
with a narrower mass range to reduce the number of detected peptides. The 

20 peptides were identified by comparing their MS and CID data to the calculated 
MS and CID of the proteins in the Genpept databank 
(www.ncbi.nlm.nih.gov/genpept) using the Sequest software [25] (obtained 
from Finnigan, San Jose, CA). The number of times each peptide was 
fragmented by CID was usually limited to two by dynamic exclusion, a feature 

25 of the Xcalibur control software the LCQ mass spectrometer (Finnigan). 
Stabilization of cell surface HLA-2.1 by peptide binding: 
RMA-S-HHD cells were washed three times with PBS followed by 
incubation overnight in FCS-free IMDM medium at 26°C. Synthetic peptides 
were added to 10 6 cells at a concentration of 100 jaM. The cells were 

30 incubated for two hours at 26°C followed by two hours at 37°C. The 
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stabilization of the HHD MHC by the peptides binding was measured by 
FACS analysis on Becton Dickinson FACStar flow cytometer after decorating 
the cells with W6/32 mAb at 4°C for one hour and then 30 min incubation with 
anti-mouse FITC at 4°C (Sigma). 
5 Cytotoxic T lymphocytes assays: 

Transgenic mice expressing a single chain HLA-A2.1/Db~p2m which 
are double knockout for H-2Db and for p2m (HHD mice) [23] were 
immunized four times intra-peritoneally at 7-day interval with 2 x 10 6 
irradiated (5,000 rad) RMA-S-HHD-B7. 1 cells loaded for two hours at 26 °C 

io followed by two hours at 37 °C with 100 jiM of the synthetic peptides. Ten 
days after the last immunization the spleens were removed from the vaccinated 
mice. Splenocytes were re-stimulated with 100 |iM of synthetic peptides for 
five days. Viable lymphocytes were separated by lympholyte-M (Cadarlane, 
Hornby, Canada) and resuspended in RPMI-HEPES. Cytotoxic activity was 

15 measured as in [26] by admixing the lymphocytes at different ratios with 5 x 
10^ EL4-HHD cells grown in medium containing 35$ methionine and then 
loaded with the synthetic peptides. 

EXPERIMENTAL RESULTS 

20 In order to identify large number of MHC bound peptide antigens 

presented in the context of a particular MHC haplotype, different human cell 
lines were transfected with expression vectors for soluble, secreted MHC 
molecules. Indeed, different soluble MHC could be transfected into various 
cell lines resulting in enabling the recovery of large amounts of the soluble 

25 MHC molecules from the cell's growth medium. The sMHC molecules were 
recovered with their authentic patterns of peptides still bound and free of 
contamination by cellular debris and detergents. Prostate (PC3), ovarian 
(UCI-107) and breast (MDA-231 and MCF-7) cell lines were transfected with 
the DNA coding HLA-A2.1/Q10b or sMHC-A2, sMHC-B7 and sMHC-Cw4. 

30 Soluble MHC molecules were recovered from the culture medium without 



53 

disrupting the cells and the sMHC molecules were purified by immunoaffinity 
chromatography as shown in Figure 2A. About 200 ng of the sMHC 
molecules were recovered from about 10^ cells (Figures 1A-C and Figures 
2B-C). The MHC large subunit, the 02m and small amounts of antibodies that 
5 were released from the immunoaffinity column by the acid treatment were the 
only proteins detected in the column eluant. The peptides were separated from 
the proteins subunits of the MHC by ultra-filtration. The recovered heavy 
subunit of the soluble MHC molecules was confirmed to be that of HLA-A2.1 
by peptide mapping and by micro sequencing. 

io Sequencing of a large number of individual peptides was approached by 

electrospray tandem mass spectrometry. The peptides were partially resolved 
on homemade nano-capillary reversed phase columns interfaced directly to an 
electrospray mass spectrometer. The peptide mixtures were resolved by 
relatively long reversed-phase HPLC gradients on long capillary columns, 

15 enabling performing mass measurements and fragmentation of a large number 
of peptides. The mass spectra were recorded between 450 to 1500 mass units, 
which is the expected mass (m/z) range of the singly and the doubly charged 
MHC bound peptides. The mass spectrometry data included the 
total-ion-current chromatogram (TIC, Figure 3 A) and the mass spectrum of the 

20 peptides at each time point (Figure 3B). The mass spectrometer was 
programmed to repeatedly select the most abundant peptide observed in each 
spectrum and to fragment it by CID (Figure 3C). Peptides were identified by 
comparing their masses and the masses of their fragments to those calculated 
for peptides derived from all the human proteins in the databank. The 

25 computer programs were instructed to search for putative peptides resulting 
from non-specific proteolysis since the specificity of proteases responsible for 
generating the MHC bound peptides in cells is not well defined. 

The relatively high sensitivity of the capillary ESI-MS/MS analysis and 
the large amounts of peptides recovered from the cells by use of the soluble 

30 MHC, enabled to perform multiple capillary HPLC separations with each 
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peptide preparation. Peptides recovered from soluble MHC produced by about 
5x10? cells were used for each capillary chromatography. Multiple 
chromatography runs enabled to detect those peptides that were observed 
reproducibly and to combine their CID data to improve the signal-to-noise 
5 ratio of the CID spectra. The combined and improved data sets were used for 
databank searches and peptide identifications. Using relatively long capillary 
columns (of above 30 cm) and long reversed phase gradients facilitated 
achieving high resolving power. Most peptides elute normally during 15 to 30 
seconds, which was a sufficient time for the mass spectrometer to analyze up 

10 to three different co-eluting peptides. The mass spectrometer was 
programmed not to fragment any peptide more than twice in order to increase 
the total number of peptides analyzed during each chromatography. 

A total of about three thousands different peptides were sufficiently 
resolved and fragmented during the different chromatography runs of the 

15 mixtures eluted from the sHLA-A2 sHLA-B7 and sHLA-Cw4 recovered from 
the different cell lines. The large majority of the observed peptides were 
common to all the different cancer lines and only a small fraction was detected 
in only one of the cancer types. From this large number of detected peptides, 
about 200 were identified at high certainty to be derived from known proteins 

20 and the rest were not identified. Table 8 is typical list of such peptides 
recovered from the soluble MHCs and identified by the computer analysis. 
Among these peptides, fourteen were already known as MHC bound peptides. 
Those desired peptides that originate from putative tumor antigens were 
chemically synthesized to further evaluate the accuracy of their amino acid 

25 sequences and to enable to study them as MHC bound peptides and their 
significance as cancer antigens. Their amino acid sequence accuracy was 
ascertained by running a chromatography of the synthetic peptides using the 
exact conditions immediately after the natural peptides mixtures and 
comparing the chromatography retention times, the exact masses and the CID 

30 spectra of the synthetic and natural peptides (Figures 4A-B). When synthetic 
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peptides behaved identically to the natural peptides in these three criteria 
served as a clear indication that the identification was indeed correct. 
Twenty-seven of the most interesting peptides were chemically synthesized 
and confirmed to be correct by this assay, example of which is displayed in 
5 Figures 4A-B. 
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Table 8 

List of peptides recovered from sMHC of different cancer cells and identified 
at high certainty by mass spectrometry 



Peptides from soluble HLA-A2 





Mass 


Sequence 


Protein 


Pfisitirvn 1 


Score*"" 


Score** 


oyiiuicii^ 


rvcl 




(m/z) 


(SEQ ID NO:) 














1 


898.4 


1 »I-iL/ VII y y I J 


y IFN inducible protein (IP-30) 




1 J7.7 


OR 
Zo 




[41] 






T T T rY\n>TA A\f/')\ 

VAaAj Vr 1 AAV(2) 


y IFN inducible protein (IP-30) 


16-25 


1793.7 


31 




[41] 


3 


1210.4 


LLLD VPT AA VQ A(3 ) 


y IFN inducible protein (IP-30) 


16-27 


128.1 


21 




[41] 


4 


800.5 


GLLGTLVQ(4) 


Beta catenin 


400-407 


0.2 


17 


+ 




5 


913.4 


GLLGTLVQL(5) 


Beta catenin 


400-408 


181.7 


31 


+ 




6 


922.3 


ALFGALFLA(6) 


Phospholipid transfer protein 


2-10 


245.2 


23 


+ 




7 


945.4 


SLLGGDWSV(7) 


TSC-22-like protein 


22-32 


591.9 


34 


+ 




8 


947.4 


NLTISDVSV(8) 


MUC1 


130-138 


69.6 


23 


+ 


[26] 


9 


958.3 


SLWGQPAEA(9) 


Human collagen type IV 


18-25 


41.2 


23 






10 


981.7 


SLIGHLQTLO0) 


protein tyrosine posphatase 


336-344 


49.1 


32 


+ 




11 


989.5 


SLSEKTVLL(11) 


CD59 


106-114 


87.6 


29 


+ 




12 


989.4 


SLFPGKLEV(12) 


Flightless I homolog 


1010-18 


257.3 


30 


+ 




13 


1028.5 


GLIEKNIEL(13) 


DNA methyl transferase (MTDM) 


425^33 


87.6 


28 


+ 




14 


1031.4 


GLYPGLIWL(14) 


Interferon regulatory factor-6 


21-29 


864.8 


30 


+ 




15 


1038.5 


YLLPAIVHI(15) 


RNA helicase 


146-154 


408.4 


30 




[2] 


16 


1068.4 


ALSDHHIYL(16) 


Fructose bisphosphate aldolase 


216-224 


481.7 


23 


+ 


[21] 


17 


1071.5 


ILDQKJNEV(17) 


ornithine decarboxylase 


23-31 


108.8 


30 




[96] 


18 


1071.6 


ILDKKVEKV(18) 


Human HSP 90 beta, HSP 84 


570-578 


53.3 


29 




[74] 


19 


1080.4 


SLLPPTALVGL(19) 


H. Transporter SEC23 A 


156-164 


181.8 


33 






20 


1091.4 


GVYDGEEHSV(20) 


MAGE-B2 


231-240 


79.9 


20 


+ 




21 


1094.5 


SLLPPDALVGL(21) 


H. Transporter SEC23B 


150-160 


181.8 


33 






22 


1121.3 


TLWVDPYEV(22) 


B- cell translocation gene (BTG1) 


103-111 


577.3 


24 




[2] 


23 


1145.4 


FLFDGSPTYV(23) 


Fatty acid synthase (FAS) 


292-301 


26694 


23 


+ 




24 


1258.5 


FLFDGSPTYVL(24) 


Fatty acid synthase (FAS) 


292-302 


611.2 


27 


+ 




25 


1360.4 


ALWD1ETGQQTV(25 ) 


guanine nucleotide-binding 


167-178 


2366.8 


28 


+ 




Peptides from soluble HLA-B7 




Mass 


Sequence 


Protein 


Position 1 


Score 2 


Score- 5 


Synthetic 4 






(m/z) 


(SEQ ID NO:) 














i 


854.3 


VPSEPGGVL(26) 


70kDaSHP-lL 


422-30 


120 


27 


+ 




2 


883.4 


SPTQPIQL(27) 


cell membrane glycoprotein 


257-61 


80 


20 












110000 Mr 












3 


895.4 


SPALPGLKL(28) 


transmembrane activator and 


147-55 


120 


27 


+ 










CAML interactor 












4 


899.5 


APRTVALTA(29) 


HLA-SB beta 


9-17 


60 


24 




[75] 


5 


927.3 


SPKJLPVSSL(30) 


DNA binding protein homolog 


372-80 


120 


25 


+ 
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6 


989.3 


KPSLPFTSL(31) 


translation initiation codon 


79*87 


120 


28 


+ 




7 


999.5 


LVMAPRTVL(32) 


MHC class-I 


2-10 


135 


18 




[75] 


8 


1050.4 


KP AFF AEKL(3 3 ) 


annexin Al 


274-82 


80 


22 






9 


1075.4 


SPYQNIKIL(34) 


spermidine aminopropyltransferase 


128-36 


80 


20 






10 


1104.5 


AASKERSG VSL(3 5) 


Histone HI 


50-60 


36 


18 




[75] 


11 


1114.3 


APFEPLASGDL(36) 


precursor 


2-12 


240 


22 


+ 




12 


1194.5 


APSGSLAVPLAVL(37) 


hypothetical protein 


9-21 


360 


31 







Table 8: An example of MHC bound peptides that were identified by the 
Sequest software [25] (obtained from Finnigan, San Jose, CA) after mass 
spectrometer analysis. 1 Position of the first and the last amino acid of the 
5 peptide. 2c a i cu i a t e( i SCO re, estimating half the time for dissociation of the 
peptide-MHC complex [42], 3 Calculate score. 4 sequence approved by 
analyzing in comparison a synthetic peptide. 5 Peptide is known. 

Among the many peptides derived from different housekeeping proteins 
and enzymes, some peptides were determined to be derived from known tumor 

10 associated antigens. These include mucin (MUC-1) and MAGE-B2 while 
others were derived from proteins whose level is known to be significantly 
elevated in cancer cells such as beta-catenin, DNA methyl transferase and fatty 
acid synthase (Table 8). 

A comparison in the patterns of peptides presented by the same MHC in 

15 cell lines of different tissue origin enabled the identification of those peptide 
uniquely presented in only cells of a particular tissue origin. Only a few of the 
peptides were determined to be unique to specific cell lines while most of the 
peptides that were observed in all the different cell lines were derived from 
normal cellular proteins. Also, significantly different patterns of peptides were 

20 recovered from sHLA-A2 sHLA-B7 and from sHLA-Cw4. Examples for 
unique peptides which are displayed in Table 9 include peptide p922 
(phospholipid transfer protein) recovered only from PC3 cells and peptide 
p947 (SEQ ID NO: 8) (MUC1) recovered only from MCF-7 grown without 
estrogen. Peptide p945 (SEQ ID NO:7, derived from TSC-22-like protein) is a 

25 novel peptide that was detected at high level in the two-breast cancer cells 
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(MCF-7 and MDA-231), but was not observed in the ovarian (UCI-107) and 
the prostate (PC3) cancer cells. Peptide p981 (SEQ ID NO: 10) originated 
from protein tyrosine phosphatase, and was detected only in the breast cancer 
cell MDA-231. One of the most interesting novel peptides identified was 

5 pl091 (SEQ ID NO:20) derived from the tumor antigen MAGE-B2. The 
peptide was detected only in the ovarian cancer cells (UCI-107) and not in the 
other cell lines. The synthetic and natural peptides elution pattern and CID 
spectra of both were identical (Figures 5A and 5B). The binding affinity of 
this peptide to the MHC molecules was determined to be normal as assayed by 

10 reconstitution and stabilization of empty MHC on the surface of RMA-S-HHD 
cells (Figure 5C). This peptide is derived from the same region in the MAGE 
proteins, as do other previously identified MHC bound peptides derived from 
MAGE-A4 and from MAGE-A10 [27, 28] (Figure 5D). 

15 Table 9 

Comparison of MHC peptide patterns between cell lines of different cancer 

origin 



(A) 





Mass 


Sequence 


MCF-7 


1 MCF-7 


MDA- 


PC-3 


UCI- 


UCI- 




(m/z) 


(SEQ ID NO:) 




without 
estrogen 


231 




107 


101 


1 


898.4 


LLDVPTAAV(l) 


+ 


+ 


+ 


+ 


+ 


+ 


2 


1011.5 


LLLDVPTAAV(2) 


+ 


+ 


+ 




+ 


+ 


3 


1210.5 


LLLD VPT AAVQ A(3 ) 


+ 


+ 


+ 


+ 


+ 


+ 


4 


800.5 


GLLGTLVQ(4) 










+ 




5 


913.4 


GLLGTLVQL(5) 


+ 


+ 


+ 


+ 


+ 


+ 


6 


922.3 


ALFGALFLA(6) 








+ 






7 


945.4 


SLLGGDWSV(7) 


+ 


+ 


+ 








8 


947.4 


NLTISDVSV(8) 




+ 










9 


958.3 


SLWGQPAEA(9) 


+ 


+ 




+ 


+ 


+ 


10 


981.7 


SLIGHLQTU10) 






+ 








11 


989.5 


SLSEKTVLL(11) 


+ 


+ 


+ 




+ 


+ 


12 


989.4 


SLFPGKLEV(12) 


+ 


+ 




+ 


+ 


+ 


13 


1028.5 


GLEEKNIEL(13) 


+ 


+ 


+ 


+ 


+ 


+ 


14 


1031.4 


GLYPGLIWL(14) 


+ 


+ 


+ 


+ 






15 


1038.5 


YLLPAJVHI(15) 


+ 


+ 


+ 


+ 


+ 


+ 


16 


1068.4 


ALSDHHIYL( 1 6) 


+ 


+ 


+ 


+ 


+ 


+ 



17 


1071.5 


ILDQKINEV(17) 


- 


+ 


+ 


+ 


+ 


- 


18 


1071.6 


ILDKKVEKV(18) 


- 


+ 


+ 


+ 


+ 


+ 


19 


1080.4 


SLLPPTALVGL(19) 


- 


- 


+ 


- 


+ 


+ 


20 


1091.4 


GVYDGEEHSV(20) 










+ 




21 


1094.4 


SLLPPDALVGL(21) 


+ 


+ 


+ 




+ 


+ 


22 


1121.3 


TLWVDPYEV(22) 


+ 


+ 


+ 


+ 


+ 


+ 


23 


1145.4 


FLFDGSPTYV(23) 


+ 




+ 




+ 




24 


1258.5 


FLFDGSPTYVL(24) 


+ 


+ 


+ 




+ 


+ 


25 


1360.4 


ALWDIETGQQTV(25) 






+ 




+ 





(B) 





Mass 


Sequence 


C1R 


MDA-2 


UCI-10 




(m/z) 


(SEQ ID NO:) 




31 


7 


1 


854.3 


VPSEPGGVL(26) 


+ 






2 


883.4 


SPTQPIQL(27) 








3 


895.4 


SPALPGLKL(28) 


+ 






4 


899.4 


APRTVALTA(29) 








5 


999.5 


SPKLPVSSL(30) 


+ 


+ 




6 


927.3 


KPSLPFTSL(31) 


+ 




+ 


7 


989.3 


LVMAPRTVL(32) 








8 


1050.4 


KPAFFAEKL(33) 








9 


1075.4 


SPYQNDCIL(34) 




+ 




10 


1104.5 


AASKERSGVSL(35) 






+ 


11 


1114.3 


APFEPLASGIL(36) 


+ 


+ 


+ 


12 


1194.5 


APSGSLAVPLAVL(37) 




+ 





Table 9: (A) The appearance of peptides from soluble HLA-A2 in breast 
cancer cells MCF-7 and MDA-231, 1 MCF-7 that grown without estrogen, 
5 prostate cancer cell PC-3 and the ovarian cancer cells UCI-107 and UCI-10L 
(B) The appearance of peptides from soluble HLA-B7 in B cell leukemia 
cancer cells C1R, breast cancer cells MDA-231 and ovarian cancer cells 
UCI-107. 

io Another approach to ascertain that the identified peptides were indeed 

MHC bound peptide antigens, their capacity to bind tightly and stabilize cell 
surface HLA-A2.1 was tested by reconstitution into empty MHC on the 
surface of RMA-S-HHD cells. Binding was assayed by FACS analysis after 
decorating the cells with the fluorescent anti-intact MHC mAb W6/32 (Figure 

15 6). Nine of the synthetic peptides were determined to stabilize cell surface 
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MHC significantly more than without the added peptides and to a similar 
extent as peptide (G9-209-2M) IMDQVPFSV (SEQ ID NO:42), derived from 
the melanoma protein gp-100 [29]. 

To further evaluate the affinity of the peptides to the HLA-A2 and to 
5 obtain some insight into their immunogenic potential, selected peptides were 
tested for their ability to induce an immune response in HLA-A2 transgenic 
mice. It was assumed that only peptides that could be effectively presented 
and remain tightly bound to the cells would be capable of inducing an immune 
response in these mice. The same synthetic peptides that were used for the 

10 FACS analysis were used for immunization of the HHD transgenic mice, 
which express the human HLA-A2.1/Db- p2m single chain. To immunize the 
mice, the HHD culture cells were loaded with the different peptides and then 
injected to the HHD mice. The immune response in the mice was followed by 
the appearance of CTLs specific for these peptides. The lysis patterns of the 

is target HHD cells by T-cells taken from the immunized mice are shown in 
Figure 7. Some of the peptides were indeed capable of inducing an immune 
response, which both authenticate them as MHC bound peptides and gives an 
indication about their immunogenic potential. The CTL results demonstrate 
significant lysis of EL4-HHD cells loaded with the peptides pi 028 (SEQ ID 

20 NO: 13) from DNA methyl transferase, pl258 (SEQ ID NO:24) from fatty acid 
synthase, pi 121 (SEQ ID NO:22) from B cell translocation gene (BTG) and 
pi 068 (SEQ ID NO: 16) from aldolase as compared to the negative control 
peptide ALLCAPSLL (SEQ ID NO:43). 



25 SUMMARY OF PEPTIDE INFORMA TION FOR SOLUBLE HLA-A2 

The following provides a summary of peptide information so far 
collected for peptides bound to soluble HLA-A2, HLA-B7 and HLA-Cw4 
using the method of the present invention. 

The following notations are used herein: 



30 



10 



15 



20 



25 



30 
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G: group number until May 7 , 2001 

Mg: mass of the natural peptide 

Mp: mass of the identified peptide 

Tg: observed retention time of the natural peptide 

Tp: calculated retention time of the identified peptide 

S: calculated internal score 

A2:- adherence to HLA-A2 consensus motif 

B7 : adherence to HLA-B7 consensus motif 

Cw4 : adherence to HLA-Cw4 consensus motif 

P: identified peptide sequence 

PR: protein from which sequence is derived 

POS: location of peptide in protein 

genpept: link to protein information in GenBank 



ref : 


previously known peptide 


Cell lines: 




#D: 


PC3+A2/Q10 


#E: 


MCF7+A2/Q10 


#F: 


MDA-231+A2/Q10 


#EST: 


MCF7(with estrogen) +A2/Q10 


#FR: 


MCF7 (without estrogen) +A2/Q10 


#G: 


UCI-107+A2/Q10 


#H: 


ClR+sB7 


#1: 


UCI-107+sB7 


#J: 


MDA-231+SB7 


#K: 


UCI-101+SA2 


#L: 


2780 (ovarian cancer cell line)+sA2 


#M: 


MDA-231+sCw4 


#N: 


UCI-107+sCw4 


#0: 


526 (lung cancer cell line) +A2/Q10 


#S: 


synthetic peptides 



G=1990: Mg= 800.4: #S+(2,1) #G+ ( 10 , 7 ) S=83 ( 87 , 74 ) Mg= 800.5 Tg=38+-0 
35 Tp= 53 Mp=800.5 A2= 0.02/18 P=GLLGTLVQ 

genpept PR=>gi | 860988 | emb | CAA61107 . 1 | (X87838) beta-catenin 

[Homo sapiens] POS=399 (SEQ ID NO: 4) 

G=1234: Mg= 810.3: Tg=»31+-1 #D+(2,2) #E+(7,4) #F+(4,3) #EST+(7,2) 
40 #FR+(4,2) #G+(33,11) #K+(3 r 2) #L+(2 / l) 

S=84(87,79) Mp= 810.2 (-0.1) Tp= 34 A2= 11/29 P=ALAPGL PTA 
genpept PR=>gi | 57 71535 | gb I AAD514 19 . 1 | AF17 3937_1 (AF17 3937) secreted 
protein of unknown function [Homo sapiens] POS=21 (SEQ ID NO: 44) 
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G=1251: Mg= 811.4: Tg=35+-0 #G+(5,3) 

S=96(96,99) Mp= 811.5 ( 0.1) Tp= 36 A2-465/26 P=KLLEPVL 
genpept PR=>gi | 3384 4 7 j gb I AAA6058 3 . 1 | (M60854) RPS16 [Homo sapiens] 
5 POS=50 (SEQ ID NO: 45) 

G=1378: Mg= 841.3: Tg=41+-0 #G+(9,5) 

3=77(83, 66) Mp= 841.4 { 0.1) Tp= 40 A2=0.0/16 P=SLLPAIVE 
genpept PR=>gi | 18 9428 I gb I AAA36399 . 1 I (J02902) phosphatase 2A 
10 regulatory subunit [Homo sapiens] POS=403 (SEQ ID NO: 46) 

G=1419: Mg= 848.3: Tg=34+-1 #E+(2,1) #F+(2,1) #G+(5,5) #K+(1,1) 
S=84(83,89) Mp= 848.4 ( 0.1) Tp- 34 A2= 52/26 P=SVLGSLSSV 
genpept PR=>gi | 58 33114 | gb I AAD534 01 . 1 | AF107 8 4 0_1 (AF107 84 0) nuclear 
15 pore-associated protein [Homo sapiens] POS=280 (SEQ ID NO: 47) 

G-1420: Mg= 848.4: Tg=37+-1 #D+(2,2) #E+(11,6) #F+(8,5) #EST+(3,2) 
#FR+(4,2) #G+(25,11) 

S=95(94,99) Mp= 848.4 ( 0.0) Tp= 39 A2=118/28 P=LLGPPPVGV 
20 genpept PR->gi | 10436199 | dbj j BAB14 750 . 1 | (AK023978) unnamed protein 
product [Homo sapiens] POS=159 (SEQ ID NO: 48) 

G=1439: Mg= 852.3: Tg=22+-2 #FR+(2,1) #I+(32,6) 

S=83(81, 89) Mp= 852.0 (-0.3) Tp= 25 A2=0.0/l P=PGPPPPPPP 
25 genpept PR=>gi I 568 9367 | db j I BAA82 967 . 1 | (AB021227) membrane -type -5 
matrix metalloproteinase [Homo sapiens] POS=ll (SEQ ID NO: 49) 

G=1492: Mg= 860.3: Tg=37+-1 #G+(9,5) 

S=81(87,69) Mp= 860. 3( 0.0) Tp= 31 A2=116/28 P=SMSGPLIGV 
30 genpept PR=>gi | 14 6918 9 I dbj I BAA09482 . 1 | (D50923) The KIAA0133 gene 
product is novel. [Homo sapiens] POS=629 (SEQ ID NO: 50) 

G=1510: Mg= 862.5: Tg=29+-2 #F+(4,4) #G+(4,2) 

3=78(87,59) Mp= 862.2 (-0.3) Tp= 27 A2-116/32 P=SMAPGLTSV 
35 genpept PR=>gi | 124 84 55 9 | gb I AAF20366 . 2 | AF1507 54_1 (AF150754) 

3 'phosphoadenosine 5 1 -phosphosulf ate synthase 2b isoform [Homo 
sapiens] POS=542 (SEQ ID NO: 51) 

G=1540: Mg- 868.4: Tg=43+-0 #E+(2,1) #F+(7,4) #FR+(1,1) #G+(4,4) 
40 #K+(3,3) 

3=83(89, 69) Mp= 868. 4( 0.0) Tp= 46 A2= 19/30 P=LLIPGLATA 
genpept PR=>gi | 2274 974 | emb I CAA574 8 9 . 1 I (X81900) NADH oxidoreductase 
subunit MWFE [Homo sapiens] POS=16 (SEQ ID NO: 52) 
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G=1563: Mg= 871.3: Tg=36+-1 #D+(1,1) #E+(8,6) #F+(4,3) #EST+(3,2) 
#FR+ (5, 2) #G+ (15, 9) 

S=86(89,79) Mp= 871.4 ( 0.1) Tp= 33 A2==592/33 P=GLLGNVAEV 
5 genpept PR=>gi | 12655181 I gb | AAH014 4 7 . 1 | AAH014 4 7 (BC001447) Similar to 
ZYG homolog [Homo sapiens] POS=10 (SEQ ID NO: 53) 

G=1575: Mg= 872.4: Tg=32+-3 #D+(2 / l) #E+(3,3) #F+(24,7) #EST+(3,2) 
#FR+ (4, 2) #G+ (12, 7) 
10 S=79(76,88) Mp= 872.5 ( 0.1) Tp= 33 A2= 11/26 P=SLIKLVEA 
genpept PR=>gi | 7020538 ! dbj | BAA91170 . 1 I (AK000444) unnamed protein 
product [Homo sapiens] POS=277 (SEQ ID NO: 54) 

G=1606: Mg= 876.4: Tg=28+-2 #E+(2,1) #F+(2 / l) #EST+(1,1) #FR+(4,2) 
15 S=84(82,89) Mp= 876.3 (-0.1) Tp= 32 A2=201/31 P=GLAESVSTL 
genpept PR=>gi | 12 652733 | gb | AAH00116 . 1 | AAH00116 (BC000116) Similar to 
KIAA0174 gene product [Homo sapiens] POS=95 (SEQ ID NO: 55) 

G=1621: Mg= 878.3: Tg=40+-1 #E+(6,4) #F+(4,2) #EST+(1,1) #FR+(3,2) 
20 #G+(11,8) #K+(1,1) 

S=88(92,79) Mp= 878.4 ( 0. 1) Tp= 40 A2= 55/30 P-AIIGGTFTV 
genpept PR=>gi | 633024 3 I dbj I BAA8 64 95 . 1 | (AB033007) KIAA1181 protein 
[Homo sapiens] POS=304 (SEQ ID NO: 56) 

25 G-1637: Mg= 880.3: Tg=37+-0 #H+(5,3) 

S=88(89, 88) Mp= 880.4 ( 0.1) Tp= 40 A2= 2/23 P=IITGPAPVL 
genpept PR=>gi j 754 2357 | gb I AAF63417 . 1 | AF14 2 4 22_1 (AF14 2422) QUAKING 
isoform 3 [Homo sapiens] POS=250 (SEQ ID NO: 57) 

30 G=1655: Mg= 882.3: Tg=43+-1 #F+(1,1) #G+(6,4) #K+(1,1) 

S=87(87,88) Mp= 882. 3( 0.0) Tp= 43 A2=0.0/18 P=SFDGWATV 
genpept PR=>gi | 7263944 | emb | CAB81773 . 1 | (AJ276359) mucin 4 [Homo 
sapiens] POS=1560 (SEQ ID NO:58) 

35 G=1732: Mg= 894.4: Tg=42+-0 #G+(6,3) 

S=79(82, 72) Mp= 894.4 ( 0.0) Tp= 41 A2= 2/20 P=LPPDALVGL 

genpept PR=>gi | 12 9666 6 I emb | CAA65775 . 1 I (X97065) Sec23 protein [Homo 
sapiens] POS=158 (SEQ ID NO:59) 

40 G=1737: Mg= 895.3: Tg=14+-1 #G+(4,2) 

S-76(79, 72) Mp= 895.4 ( 0.1) Tp= 22 A2= 47/25 P=ILDAGGHNV 
genpept PR=>gi | 1808578 | dbj | BAA07918 . 1 | (D44466) proteasome subunit 
pll2 [Homo sapiens] POS=736 (SEQ ID NO: 60) 
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G=1744: Mg= 896.3: Tg=28+-3 #D+(6,3) #E+<14,5) #F+(20,6) 

#EST+(5,2) #FR+(6,3) #G+(47,11) #K+(8,4) 

3=92(98,81) Mp= 896. 4 ( 0.1) Tp= 30 A2-512/30 P=GLYSGVTTV 
5 genpept PR=>gi I 360 65 I emb I CAA42118 . 1 | (X59543) Ml subunit of 
ribonucleotide reductase [Homo sapiens] POS=46 (SEQ ID NO: 61) 



G=1745: Mg= 896.3: Tg=55+-0 #G+(8,4) 

3=79(76,86) Mp= 896.5 ( 0.2) Tp= 60 A2=>lk/24 P=FLYPFPL 

10 genpept PR=>gi | 4 36224 I dbj | BAA050 62 . 1 | (D26067) KIAA0033 [Homo 
sapiens] POS=185 (SEQ ID NO: 62) 



G=1768: Mg= 898.4: Tg=36+-1 #D+(11,3) #E+(50,10) #F+(13,7) 

#EST+(7,2) #FR+(13,3) #G+(63,11) #K+(15 f 6) #L+(9,3) 
15 S=81(78, 89) Mp= 898.4 ( 0.0) Tp= 36 A2= 47/28 P=LLDVPTAAV 
genpept PR=>gi | 6165 618 | gb | AAF04 618 . 1 | AF0 97 362_1 (AF097 3 62) 

gamma-interf eron inducible lysosomal thiol reductase [Homo sapiens] 
POS=27 (SEQ ID NO:l) ref 



20 G=1770: Mg= 898.4: Tg=38+-1 #E+(4,2) #F+(3,2) #FR+(1,1) #G+(18,11) 
#K+ (2, 1) 

S=88(92,79) Mp= 898. 3 (-0.1) Tp= 41 A2= 79/29 P=ALLPSSPTL 
genpept PR=>gi I 1737205 I gb I AAB38876 . 1 | (U75276) TFIIB related factor 
hBRF [Homo sapiens] POS=609 (SEQ ID NO: 63) 

25 

G=1786: Mg= 899.5: Tg=26-f-2 #F+(2,2) #EST+(4,2) #FR+(5,2) 

#G+(32,8) #K+(8,5) #L+(1,1) 

S=93(96,89) Mp= 899.5 ( 0.0) Tp= 27 A2=243/25 P=KLGSVPVTV 
genpept PR=>gi | 12 653 653 I gb I AAH00 609 . 1 1 AAH00 609 (BC000 60 9) KIAA0738 
30 gene product [Homo sapiens] POS=623 (SEQ ID NO: 64) 

G=1795: Mg= 900.4: Tg=53+-0 #D+(2,2) #E+(7,5) #F+(11,6) #EST+(2,1) 
#FR+(6,3) #G+(21,11) #K+(16,6) 

S=81(86,72) Mp= 900.5 ( 0.1) Tp- 55 A2 = 182/33 P=ALFPGVALL 

35 genpept PR=>gi | 224 5365 | gb I AAC51518 . 1 | (U75885) ER-60 protein [Homo 
sapiens] POS=7 (SEQ ID NO: 65) 



G=1802: Mg= 901.3: Tg=33+-2 #E+(3,2) #G+(2,1) 

S=85(92,69) Mp= 901.5 ( 0.2) Tp= 32 A2=160/29 P=GLVGSLQEV 
40 genpept PR=>gi | 11967711 | emb I CAC19484 . 1 | (AJ278357) Tsg24 protein 
[Homo sapiens] POS=56 (SEQ ID NO: 66) 



G=1804: Mg= 901.4: Tg=31+-2 #E+(1,1) #FR+(1,1) #G+(6,3) 
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S=90(95,79) Mp= 901.3 (-0.1) Tp= 22 A2= 2/18 P-APLSDTAQV 
genpept PR=>gi I 104 3878 9 I dbj | BAB1534 4 . 1 | (AK026063) unnamed protein 
product [Homo sapiens] POS=197 (SEQ ID NO: 67) 

5 G=1804: Mg= 901.4: Tg=31+-2 #E+(l f l) #FR+(1,1) #G+(6,3) 

S=89(94,79) Mp- 901.5 ( 0.1) Tp= 36 A2=160/33 P=SLASLLAKV 
genpept PR=>gi | 84 89831 | gb I AAF7 5772 . 1 | AF2 65555_1 (AF2 65555) 

ubiquitin-conjugating BIR-domain enzyme APOLLON [Homo sapiens] 
POS=1230 (SEQ ID NO: 68) 

10 

G=1822: Mg= 903.3: Tg=16+-7 #D+(7,3) #E+(14,4) #F+(14 / 4) 

#EST+(18,2) #FR+(13,3) #G+(116,10) #K+(20,6) 

S=92(98,79) Mp= 903.4 ( 0.1) Tp= 23 A2 = 160/29 P=GLAT DVQTV 
genpept PR=>gi | 56564 7 | dbj I BAA0564 5 . 1 I (D26598) proteasome subunit 
15 HsClO-II [Homo sapiens] POS=55 (SEQ ID NO: 69) 

G=1860: Mg= 907.5: Tg=39+-1 #D+(12,4) #E+(1,1) #F+(9,5) #EST+(3 f l) 
#FR+(4,2) #G+(10,6) #K+(4,3) 

S=88(91,81) Mp= 907.5 ( 0.0) Tp- 37 A2= 79/31 P=SLFGGSVKL 
20 genpept PR=>gi | 13375569 I gb | AAK20398 . 1 | AF34 9951_1 (AF349951) HP95 
[Homo sapiens] POS=103 (SEQ ID NO: 70) 

G=1861: Mg= 907.6: Tg=38+-1 #EST+(2,1) #FR+(1,1) 

S=87(87,89) Mp= 907. 6 ( 0.0) Tp= 32 A2= 21/19 P=KVGPVPVLV 
25 genpept PR=>gi | 12804 623 | gb | AAH01734 . 1 | AAH01734 (BC001734) protein 
translocation complex beta [Homo sapiens] POS=67 (SEQ ID NO: 71) 

G-1899: Mg- 910.3: Tg=46+-1 #E+(8,5) #F+(21 / 7) #EST+(4,2) 

#FR+(7,3) #G+(24,11) #K+(13,6) #L+(3,2) 
30 S=72(65,89) Mp= 910.4 ( 0.1) Tp= 40 A2 = 182/32 P=GLLPDVPSL 
genpept PR=>gi I 13623421 | gb I AAH06309 . 1 | AAH06309 (BC006309) Similar to 
RIKEN cDNA 5730589L02 gene [Homo sapiens] POS=141 (SEQ ID NO: 72) 

G=1901: Mg= 910.4: Tg=39+-0 #D+(2,1) #E+(5,2) #F+(9,5) #EST+(1,1) 
35 #FR+<1,1) 

S=80(90,59) Mp= 910.4 ( 0.0) Tp= 41 A2 = 160/30 P^ALPPVLTTV 
genpept PR=>gi I 3882183 I dbj | BAA34 4 51 . 1 I (AB018274) KIAA0731 protein 
[Homo sapiens] POS=131 (SEQ ID NO: 73) 



40 G=1904: Mg= 910.4: Tg=38+-1 #£+(2,1) #F+(3,2) #EST+(1,1) #FR+(3,2) 
#G+(3,3) #K+<2,1) 
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S=90(95,79) Mp= 910. 5{ 0.1) Tp= 32 A2= 52/24 P=GVLPNIQAV 
genpept PR=>gi | 72 64 004 | emb I CAB8165 6 . 1 | (AL04 9822) dJ160A22 . 4 

(histone H2A) [Homo sapiens] POS=107 (SEQ ID NO: 74) 

G=1922: Mg= 912.5: Tg=42+-1 #E+(5,4) #F+(2 / l) #FR+(1 / 1) #G+(3,2) 
#K+ (1, 1) 

S=78(83,69) Mp= 912.5 ( 0.0) Tp= 43 A2= 49/31 P=ALTPVVVTL 
genpept PR=>gi | 1317 7 7 39 | gb| AAH0364 4 . 1 | AAH0364 4 (BC00364 4) 

cyclinrdependent kinase 4 [Homo sapiens] POS=170 (SEQ ID NO: 75) 

G=1931: Mg= 913.4: Tg=34+-1 #E+<8,4) #F+(2,2) #EST+(3,2) #FR+(1,1) 
S=84(96,59) Mp= 913. 3 (-0.1) Tp= 29 A2- 70/27 P=ALN PAD I TV 
genpept PR->gi | 6634 42 1 | emb | CAB64 373 . 1 I (AJ238375) putative protein 
TH1 [Homo sapiens] POS=103 (SEQ ID NO: 76) 

G=1933: Mg= 913.4: Tg=49+-0 #S+(12 / 2) #D+(17,5) #E+(16,8) 

#F+(18,7) #EST+(2,1) #FR+(4,2) #G+(22,11) #H+(1,1) #K+<10,6) 
S=93(95,89) Mp= 913. 6( 0.2) Tp= 46 A2-182/31 P=GLLGTLVQL 
genpept PR=>gi | 38520 I emb | CAA794 97 . 1 | (Z19054) beta catenin [Homo 
sapiens] POS=400 (SEQ ID NO: 5) 

G=1939: Mg= 914.4: Tg=40+-0 #G+{4,3) 

S=82(79,89) Mp= 914.4 { 0.0) Tp= 42 A2=0.0/16 P=DAEGLALLL 

genpept PR=>gi | 1060907 | dbj I BAA11242.il (D78177) quinolinate 

phosphoribosyl transferase [Homo sapiens] POS=2 (SEQ ID NO: 77) 

G=1942: Mg= 914.5: Tg=16+-3 #F+(4,1) #G+(6,3) 

5-90(95,79) Mp= 914.4(-0.1) Tp= 27 A2=160/29 P-SLTGHISTV 

genpept PR=>gi | 2832296 I gb I AAD0 94 07 . 1 | (AF04 4 333) pleiotropic 
regulator 1 [Homo sapiens] POS=241 (SEQ ID NO:78) 

G-1948: Mg= 915.5: Tg=38+-0 #D+(2,1) |F+(11,7) 

S=91(97, 77) Mp= 915. 6 ( 0.1) Tp= 42 A2=0.5/15 P=VHVLTFTV 
genpept PR=>gi | 3242214 | emb | CAA0724 3 . 1 | (AJ006778) DRIM protein [Homo 
sapiens] POS=1898 (SEQ ID NO:79) 

G=1974: Mg= 918.3: Tg=36+-1 #F+(3,2) #G+(19,10) 

S=84(88,77) Mp= 918. 6 ( 0.3) Tp= 34 A2= 6/25 P=SLKYVPLV 
genpept PR=>gi | 10436278 | dbj | BAB14783 . 1 | (AK024024) unnamed protein 
product [Homo sapiens] POS=248 (SEQ ID NO: 70) 

G-1979: Mg= 918.6: Tg=53+-0 #E+(5,3) #F+(6,3) #EST+(1,1) #FR+(4,2) 
#G+<4,2) #K+(4,3) 
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5=81(84,74) Mp= 918.5(-0.1) Tp= 54 A2=0.8/16 P=LPYWGVAL 

qenpept PR=>gi I 7023639 I dbj I BAA92035 . 1 | (AK002014) unnamed protein 
product [Homo sapiens] POS=272 (SEQ ID NO: 71) 

5 G=1988: Mg= 920.3: Tg=32+-1 #E+<2,2) #F+(5,2) #FR+(1,1) #G+(35,11) 
#K+ (1, 1) 

3=89(90,89) Mp= 920. 3( 0.0) Tp= 27 A2= 31/24 P=SIYPSPTGV 
genpept PR=>gi | 3661 610 | gb I AAC6177 6 . 1 | (AF092565) splicing factor 
Prp8 [Homo sapiens] POS=1693 (SEQ ID NO: 72) 

10 

G=2008: Mg= 922.3: Tg=58+-1 #S+(9,1) #D+(17,5) #L+(8,2) 
S=79(80,77) Mp= 922.5 ( 0.3) Tp= 59 A2=245/22 P = AL FG AL FLA 
genpept PR=>gi | 2 6534 32 | db j | BAA23 64 7 . 1 | (AB005297) BAI 1 [Homo 
sapiens] POS=1163 (SEQ ID NO: 6) 

15 

G=2023: Mg= 924.2: Tg=15+-14 #F+(6,3) #FR+(1,1) 

S=83(86,79) Mp= 924.4 ( 0.2) Tp= 33 A2= 11/24 P=ALAS HL I EA 
genpept PR=>gi | 7212807 | gb | AAF4 04 70 . 1 | AF18 1263_1 (AF181263) EH domain 
containing 2 [Homo sapiens] POS=507 (SEQ ID NO: 73) 

20 

G-2027: Mg= 924.5: Tg=13+-1 #G+<3,2) 

S=83<85,79) Mp= 924.4 (-0.1) Tp= 20 A2= 75/24 P=KLGPAPKTL 
genpept PR=>gi | 408198 I gb | AAB27691 . 1 | (S64671) DNA-binding 

protein/plasminogen activator inhibitor-1 regulator [human, HeLa S3, 
25 Peptide Partial, 176 aa] [Homo sapiens] POS=133 (SEQ ID NO: 74) 

G=2029: Mg= 924.5: Tg=43+-1 #F+(1,1) #G+(17,8) 

S=93(91, 99) Mp= 924. 6 ( 0.1) Tp= 44 A2=>lk/27 P=KLLEPVLL 
genpept PR=>gi | 3384 47 | gb | AAA60583 . 1 | (M60854) RPS16 [Homo sapiens] 
30 POS-50 (SEQ ID NO: 75) 

G=2050: Mg= 926.5: Tg=14+-3 #F+(8,2) #EST+(1,1) #FR+(3,2) #G+(6,3) 
S=90(96,79) Mp= 926.4 (-0.1) Tp= 29 A2= 78/30 P=ALSGHLETV 
genpept PR=>gi 1 12314 197 | emb | CAB9934 2 . 1 | (AL139008) bA255A11.8 (novel 
35 protein similar to annexin A2 (ANXA2) (lipocortin II, calpactin I 
heavy chain, chromobindin 8, PAP-IV) ) [Homo sapiens] POS=90 (SEQ ID 
NO : 7 6 ) 

G=2068: Mg= 929.5: Tg=43+-1 #E+(2,2) #F+(20,7) #FR+(1,1) 

40 #G+(36,11) #K+(24,6) #L+(3,2) 

S=92(90,99) Mp= 929. 5 ( 0.0) Tp= 31 A2=173/25 P=SLLDKIIGA 
genpept PR=>gi | 11034 809 I gb I AAG270 93 . 1 |AF312393_1 (AF3123 93) 

leucine-zipper protein FKSG13 [Homo sapiens] POS=56 (SEQ ID NO: 77) 
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G=2071: Mg= 930.3: Tg=35+-1 #F+(4,3) #G+(13,7) #K+(3,2) 
S=91(97,77) Mp= 930.4 ( 0.1) Tp= 33 A2=257/33 P=GLLGAGGTVSV 
genpept PR=>gi I 114 93522 I gb I AAG35534 . 1 | AF130117_68 (AF130109) PR01512 
5 [Homo sapiens] POS=17 (SEQ ID NO: 78) 

G=2072: Mg= 930.4: Tg=53+-0 #D+(8,4) #E+(21,8) »F+(10,5) 

#EST+(7,2) #FR+(8,3) #G+(11,7) #K+(7,4) 

S=78(83,69) Mp= 930. 6{ 0.2) Tp= 53 A2=608/32 P=GLVPFLVSV 
10 genpept PR=>gi | 1354 3657 | gb I AAH05 97 8 . 1 | AAH05978 (BC005978) 

karyopherin alpha 2 (RAG cohort 1, importin alpha 1) [Homo sapiens] 
POS=377 ( SEQ ID NO:79)ref 



G-2095: Mg= 932.5: Tg=46+-0 #F+(13,7) #G+(1,1) #K+(2,1) 
15 S=72(74,69) Mp= 932 . 5 ( 0.0) Tp= 46 A2= 54/27 P=ILGLGYPSL 
genpept PR=>gi I 7339520 I emb I CAB82850 . 1 I (AJ250717) procathepsin E 
[Homo sapiens] POS=184 (SEQ ID NO: 80) 

G=2126: Mg= 936.3: Tg=39+-1 #E+(1,1) #F+(7,5) #G+(1,1) 
20 S=81(86,72) Mp= 936.4 ( 0.1) Tp= 40 A2=213/26 P=ALLAGSEYL 
genpept PR=>gi I 12 653123 I gb I AAH00328 . 1 | AAH00328 (BC000328) eukaryotic 
translation initiation factor 3, subunit 7 (zeta, 66/67kD) [Homo 
sapiens] POS=439 (SEQ ID NO:81) 

25 G=2146: Mg= 938.3: Tg=38+-0 #FR+(2,1) 

S=77(72,90) Mp= 938 . 5 ( 0.2) Tp= 34 A2=656/33 P=SLAELVHAV 

genpept PR=>gi | 4 0928 63 I gb I AAD04 8 12 . 1 I (AF033122) non-p53 regulated 

PA26-T1 nuclear protein [Homo sapiens] POS=254 (SEQ ID NO: 82) 



30 



G=2160: Mg= 940.4: Tg=58+-1 #E+(6,4) #F+(8,3) #G+(1,1) #K+(3,2) 



3=82(80, 88) Mp= 940. 6( 0.2) Tp= 53 A2= 8/15 
genpept PR=>gi | 181159 | gb I AAB59528 . 1 | (J03072) serine 
[Homo sapiens] POS=l (SEQ ID NO: 83) 



P=MQPILLLL 
protease B 



35 G=2176: Mg= 942.1: Tg=48+-0 #G+(5,3) 

S=74(74,77) Mp= 942. 5( 0.4) Tp= 48 A2= 2/12 P=GLFAPQFY 
genpept PR=>gi | 206237 1 | gb I AAB65850 . 1 I (U70730) SnoN2 [Homo sapiens] 
POS=274 (SEQ ID NO:84) 



40 G=2208: Mg= 944.5: Tg=59+-2 #S+(2,1) #L+(4,2) 

S=86(94,69) Mp= 944. 5( 0.0) Tp= 48 A2=577/25 P=ALWGQGTLV 
genpept PR=>gi | 773628 | gb | AAA88873 . 1 I (U21267) immunoglobulin mu 
heavy chain [Homo sapiens] POS=103 (SEQ ID NO: 85) 
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G=2213: Mg» 945.4: Tg=37+-1 #S+(12,1) #E+(60,10) #F+(6,5) 

#EST+(5,2) #FR+(11,3) #K+(1,1) 

S=87(97,66) Mp= 945.5 ( 0.1) Tp= 34 A2=592/34 P=SLLGGDWSV 
5 genpept PR=>gi | 5231131 | gb | AAD41085 . 1 | AF153603_1 (AF153603) TSC-22 
related protein [Homo sapiens] POS=27 (SEQ ID NO: 7) 

G-2231: Mg= 947.3: Tg=41+-1 #E+(4,2) #FR+(1,1) #G+(11,6) 
S=85(88,81) Mp= 947. 3( 0.0) Tp= 20 A2=0.0/18 P = D T E T AWN V 
10 genpept PR=>gi | 4883681 | gb | AAD31596 . 1 | AF057352_1 (AF057352) 

hepatocellular carcinoma autoantigen [Homo sapiens] POS=117 (SEQ ID 
NO:86) 

G=2233: Mg= 947.4: Tg=34+-1 #S+(9,1) #EST+(3,2) #FR+(4,2) 
15 S=90(90,90) Mp= 947.4 ( 0.0) Tp= 30 A2= 70/23 P=NLTISDVSV 
genpept PR=>gi | 54 1 680 | emb I CAA5 67 34 . 1 I (X80761) MUC1 [Homo sapiens] 
POS=133 (SEQ ID NO:8) ref 

G=2241: Mg= 948.3: Tg=58+-1 #E+(6,3) #F+(4,3) #EST+(1,1) #FR+(3,2) 
20 #K+(5,3) 

S=75(74, 79) Mp= 948.5 ( 0.2) Tp= 62 A2=203/21 P=ALL P I FFGA 
genpept PR=>gi | 13185197 | emb | CAC33282 . 1 | (AX083359) unnamed protein 
product [Homo sapiens] POS=4 3 (SEQ ID NO: 87) 

25 G=2270: Mg= 951.6: Tg=40+-1 #D+(11,4) #E+(49,10) #F+(2,1) 

#EST+(3,1) #FR+(7,3) #G+<11,6) #K+(4,2) #L+(4,2) 

S=88(93,79) Mp= 951.5(-0.1) Tp= 33 A2=191/22 P=AMVIFKSGV 
genpept PR=>gi | 392 952 9 I gb I AAC82 612 . 1 | (AF034 611) intrinsic 

factor-B12 receptor precursor; cubilin [Homo sapiens] POS=3371 (SEQ 
30 ID NO:88) 

G=2299: Mg= 954.4: Tg=50+-0 #D+(1,1) #£+(31,9) #F+(15,7) 

#EST+(5,2) #FR+(8,3) #G+(28,11) #K+(16,6) #L+<4,2) 

S = 81(89, 63) Mp= 954. 5 ( 0.1) Tp= 49 A2 = 182/34 P=SLLPAIVEL 
35 genpept PR=>gi I 36034 18 | gb | AAC63525 . 1 | (AF083439) protein phosphatase 
2A regulatory subunit A, beta isoform [Homo sapiens] POS=415 (SEQ 
ID NO:89)ref 

G=2329: Mg- 956.6: Tg=33+-2 #EST+(3,2) #FR+(5,3) 
40 S=81<83,79) Mp= 956. 5 (-0.1) Tp= 32 A2=736/32 P=YLGPHIASV 
genpept PR=>gi | 12052942 | emb I CAB6664 6 . 1 I (AL136711) hypothetical 
protein [Homo sapiens] POS=137 (SEQ ID NO: 89) 
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G=2344: Mg= 958.3: Tg=33+-2 #S+(6,2) #D+(8,2) #E+(100,8) 

#G+(34,11) #K+(4,2) 

S=96(96,99) Mp= 958. 3{ 0.0) Tp= 38 A2= 41/23 P=SLWGQPAEA 
genpept PR=>gi | 4 634 30 I gb I AAC27816 . 1 1 (U04520) type IV collagen alpha 
5 5 chain [Homo sapiens] POS=18 (SEQ ID NO: 9) 

G=2350: Mg= 959.3: Tg=46+-1 #G+(14,7) #K+(8,5) 

5=79(90,54) Mp= 959. 6 ( 0.3) Tp= 45 A2= 16/27 P=SLFPGQVVI 
genpept PR=>gi I 12654 999 I gb I AAH0134 7 . 1 | AAH0134 7 (BC001347) polymerase 
10 (DNA-directed) , alpha (70kD) [Homo sapiens] POS=295 ( SEQ ID NO: 90) 

G=2355: Mg= 959.5: Tg=38+-0 #F+<3,2) 

3=82(80,89) Mp= 959.5 ( 0.0) Tp= 38 A2=324/29 P=SLLEKSLGL 
genpept PR=>gi | 13529002 I gb I AAH05291 . 1 I AAH05291 (BC005291) eukaryotic 
15 translation elongation factor 1 epsilon 1 [Homo sapiens] POS=8 (SEQ 
ID NO: 91) 

G=2356: Mg= 959.5: Tg=30+-1 #D+(3,1) #E+(29,9) #F+(2,2) #EST+(7,2) 
#FR+(9,3) #G+(49,10) #K+(6,2) 
20 S=85(84,90) Mp= 959.5 ( 0.0) Tp= 27 A2=485/24 P=ILTDITKGV 
genpept PR=>gi | 181969 | gb I AAA50388 . 1 I (M19997) elongation factor 2 
[Homo sapiens] POS=161 (SEQ ID NO: 92) 

G=2372: Mg= 960.5: Tg=35+-1 #D+(2,1) #G+(5,5) 

25 S=78(73,90) Mp= 960.5 ( 0.0) Tp= 34 A2= 79/24 P=GLFQGKTPL 

genpept PR=>gi I 4 58 9 92 9 I dbj | BAA7 6931 . 1 I (AB024704) fls353 [Homo 
sapiens] POS=53 (SEQ ID NO: 93) 

G=2382: Mg= 962.3: Tg=47+-0 #G+(5,4) 
30 S=80(82,77) Mp= 962.5 ( 0.2) Tp= 11 A2=0.0/6 P=ESQLKKMV 
genpept PR=>gi | 12 8 03337 | gb I AAH024 87 . 1 | AAH024 87 (BC0024 87) tumor 
susceptibility gene 101 [Homo sapiens] POS=5 (SEQ ID NO: 94) 

G=2434: Mg= 967.3: Tg=54+-0 #G+(10,7) 
35 S=83(85, 79) Mp= 967.5 ( 0.2) Tp= 61 A2 = 139/19 P=FLYPFPLA 
genpept PR=>gi | 4 36224 | dbj | BAA050 62 . 1 | (D26067) KIAA0033 [Homo 
sapiens] POS=185 (SEQ ID NO: 95) 

G=2446: Mg= 968.4: Tg=20+-2 #F+(4,2) #G+(19 f 6) 
40 S=86(89,79) Mp= 968.4 ( 0.0) Tp= 29 A2= 78/29 P=ALTGHLEEV 
genpept PR=>gi | 34 388 I emb I CAA2 9338 . 1 I (X05908) lipocortin (AA 1-346) 
[Homo sapiens] POS=99 ( SEQ ID NO: 96) 
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G=2447: Mg= 968.4: Tg=42+-1 #D+(1,1) #E+<1,1) »F+(11,7) #FR+{1,1) 
#G+(27,11) #K+(6,3) #L+(3,2) 

S=87(87,90) Mp= 968.4 ( 0.0) Tp= 36 A2=>lk/33 P=SLLDPVPEV 

qenpept PR=>gi | 1504 020 I dbj | BAA13209 . 1 I (D86973) similar to Yeast 

translation activator GCN1 (P1:A48126) [Homo sapiens] POS=1406 (SEQ 
ID NO:97) 



G=2464: Mg= 969.5: Tg=47+-0 #E+(2,2) #F+(7,5) #G+(25,11) 
S=83(85,81) Mp= 969. 5( 0.0) Tp= 48 A2= 1/19 P=MAPQALLLL 
genpept PR=>gi | 1780998 I emb I CAA7 1531 . 1 ! (Y10520) HLA-C alpha chain 
(Cw*1701) [Homo sapiens] POS=4 (SEQ ID NO: 98) 

G=2489: Mg= 971.5: Tg=42+-0 #D+(9,4) #F+(10,6) 

S=88(91,81) Mp= 971.4 (-0.1) Tp= 42 A2= 1/23 P=FSNGYLASL 
genpept PR=>gi | 12 655065 | gb I AAH01382 . 1 I AAH01382 (BC001382) solute 
carrier family 29 (nucleoside transporters), member 1 [Homo sapiens] 
POS=405 (SEQ ID NO: 99) 



G-2495: Mg= 972.4: Tg=52+-1 #D+(25,5) #E+(19,9) #F+(24,7) 

#EST+(5,2) #FR+(8,3) #G+(43,11) #K+(32,6) #L+(4,2) 

S=91(96,81) Mp= 972. 5( 0.1) Tp= 41 A2=656/30 P=TLIEDILGV 
genpept PR=>gi | 1 1 12 14 97 | emb | CAC14 94 6 . 1 | (AL132825) dJ756N5.2 (novel 
protein ( DKFZp727M231 ) similar to Trp4-associated protein TAP1 
( ABCB2 ) ) [Homo sapiens] POS=209 (SEQ ID NO:100) 



G=2514: Mg= 973.4: Tg=33+-1 #F+(5,3) #G+(10,6) #K+(4,2) #L+(1,1) 
S=82(84,79) Mp= 973.4 ( 0.0) Tp= 31 A2=0.0/17 P=IAEAVRTTL 
genpept PR=>gi | 2559010 J gb j AAC9 6011 . 1 | (AF026292) chaperonin 

containing t-complex polypeptide 1, eta subunit; CCT-eta [Homo 
sapiens] POS=32 (SEQ ID NO: 101) 



G=2515: Mg= 973.5: Tg=34+-1 #EST+(5,2) 

S=80(80,81) Mp= 973.4 (-0.1) Tp= 31 A2=307/27 P=KLSELEAAL 
genpept PR=>gi [ 12314 174 | emb | CAC08001 . 1 I (AL137067) bA13B9.3 (novel 
protein similar to KRT8) [Homo sapiens] POS=368 (SEQ ID NO: 102) 



G-2522: Mg= 974.3: Tg=30+-2 #S+(7,1) #E+(3,3) #EST+(8,2) 
#FR+(11, 3) 

S=89(90,89) Mp= 974.5 ( 0.2) Tp= 25 A2= 6/21 P=SLSVKLEQA 

genpept PR=>gi | 37258 I emb I CAA4 4 8 1 9 . 1 I (X63105) Tpr [Homo sapiens] 
POS=453 (SEQ ID NO:104) 
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G=2527: Mg= 974.3: Tg=50+-0 #D+(1,1) #E+(15,7) #F+(9,4) #G+(22,10) 
#K+ (10,5) 

S=90(98,72) Mp= 974. 5 ( 0.2) Tp= 50 A2=413/26 P=MLLAALMIV 
genpept PR=>gi | 5802822 |gb|AAD51798 . 1 I AF1 64 614_2 (AF164614) envelope 
5 protein [Homo sapiens] POS=76 (SEQ ID NO: 105) 

G=2537: Mg= 974.5: Tg=53+-0 #F+(4,3) #G+(13,9) #K+(2,2) 
S=81(83,79) Mp= 974. 5( 0.0) Tp= 56 A2= 60/24 P=AILPTSIFL 
genpept PR=>gi I 23234 10 | gb | AAB6658 1 . 1 I (AF015913) SkblHs [Homo 
10 sapiens} POS=229 (SEQ ID NO: 106) 

G=2546: Mg= 975.4: Tg=38+-1 #E+<3,2) #F+(7,4) #G+(19,10) #K+(1,1) 
S=82(91,63) Mp= 975.4 ( 0.0) Tp= 32 A2= 8/27 P=AALPNVYEV 
genpept PR=>gi | 12652781 | gb | AAH00 1 4 2.1| AAH00142 (BC000142) 

15 minichromosome maintenance deficient (S . cerevisiae) 5 (cell 
division cycle 46) [Homo sapiens] POS=326 (SEQ ID NO: 107) 

G=2567: Mg= 977.5: Tg=22 + -3 #G+(9,5) 

S=84(82,90) Mp= 977.4 (-0.1) Tp= 24 A2=186/24 P=RMLPHAPGV 
20 genpept PR=>gi | 1 667394 | gb | AAC50814 . 1 | (U31814) transcriptional 
regulator homolog RPD3 [Homo sapiens] POS=372 (SEQ ID NO: 108) 

G=2610: Mg= 981.7: Tg=36+-0 #S #F+(3,2) 

S=79(80,79) Mp= 981.6(-0.1) Tp= 38 A2= 49/32 P=SLIGHLQTL 
25 genpept PR=>gi | 642013 | gb | AAB06261 . 1 | (U16996) protein tyrosine 
posphatase [Homo sapiens] POS=337 (SEQ ID NO: 10) 

G=2636: Mg= 984.5: Tg=61+-1 #D+(5,4) #E+(9,5) #F+<12,6) #FR+(5,2) 
#G+(2,1) #K+(12,5) #L+(1,1) 
30 3=85(91,72) Mp« 984.7 ( 0.2) Tp= 61 A2= 11/21 P=LMVLVALIL 
genpept PR=>gi | 12654925 I gb | AAH01309 . 1 | AAH01309 (BC001309) Unknown 
(protein for MGC:5508) [Homo sapiens] POS=19 (SEQ ID NO: 109) 

G=2641: Mg= 984.7: Tg=36+-0 #EST+(1,1) #FR+(2,1) 
35 S=78(77,81) Mp= 983.5 (-1.2) Tp= 35 A2-140/28 P=KI LPTLEAV 
genpept PR=>gi | 12 653227 | gb | AAH00382 . 1 | AAH00382 (BC000382) 

interleukin enhancer binding factor 2, 45kD [Homo sapiens] POS=127 
(SEQ ID NO:110) 

40 G=2649: Mg- 985.5: Tg=40+-1 #E+(5,3) #FR+(3,2) #G+(4,3) 

S=84(93,63) Mp= 985. 6( 0.1) Tp= 38 A2=>lk/33 P=ALLDRIVSV 
genpept PR=>gi | 1504 030 | dbj | BAA13214 . 1 I (D86978) similar to a 
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C.elegans protein encoded in cosmid K12D12 ( Z4 9069 ) [Homo sapiens] 
POS=1499 (SEQ ID NO: 111) 

G=2661: Mg= 986.6: Tg=35+-1 #E+(3,2) #F+(3,2) #EST+(3,2) #FR+(1,1) 
5 #G+(2,2) 

3=84(82,89) Mp= 986. 7 { 0.1) Tp= 35 A2=160/26 P=TLVYHVVGV 
genpept PR=>gi | 354 0219 | dbj | BAA32 662 . 1 | (D87686) KIAA0017 protein 
[Homo sapiens] POS=165 (SEQ ID NO: 112) 

10 G=2666: Mg= 987.4: Tg=32+-2 #D+(1,1) #E+(1,1) #F+(5,2) #G+(12 / 7) 

S=77(87,54) Mp= 987.5 ( 0.1) Tp= 33 A2=131/26 P=YLPPATQVV 
genpept PR=>gi | 1332514 6 | gb | AAH04 38 6 . 1 | AAH04 38 6 (BC004 38 6) KIAA0111 
gene product [Homo sapiens] POS=207 (SEQ ID NO: 113) 

15 G=2668: Mg= 987.4: Tg=14+-13 #F+(4,3) 

S=77(76,81) Mp- 987.3(-0.1) Tp= 26 A2=0.0/15 P= PMEALAEQV 
genpept PR=>gi | 38822 97 | dbj | BAA34508 . 1 | (AB018331) KIAA0788 protein 
[Homo sapiens] POS=569 (SEQ ID NO: 114) 

20 G=2671: Mg= 987.6: Tg=29+-1 #F+(4,3) #EST+(1,1) #FR+(3,2) 

#G+ (11, 5) 

S=74<83,54) Mp= 987.5 (-0.1) Tp= 33 A2=656/30 P=RLSEAIVTV 
genpept PR->gi I 71068 4 8 I gb I AAF3614 9 . 1 I AF1510 63_1 (AF1510 63) HSPC22 9 
[Homo sapiens] POS=137 (SEQ ID NO: 115) 

25 

G-2677: Mg= 988.3: Tg=13+-4 #E+<2,1) #F+(7,1) #EST+(4,1) #G+(21, 6) 
S=88(99,63) Mp= 988.4 { 0.1) Tp= 20 A2- 28/27 P=SLDQPTQTV 
genpept PR=>gi | 17 18197 | gb | AAD034 62 . 1 | (U46025) translation intiation 
factor eIF-3 pllO subunit [Homo sapiens] POS=834 (SEQ ID NO: 116) 

30 

G=2692: Mg= 989.4: Tg=41+-1 #S+(8,2) #D+(13,5) #E+(12,6) #F+(11,6) 
#EST+(4,2) #FR+(6,3) #G+(15,8) #K+(13,6) 

S=79(83,72) Mp= 989. 5 ( 0.1) Tp= 39 A2=257/30 P=SLFPGKLEV 
genpept PR=>gi I 440177 I gb I AAC03568 . 1 I (U01184) flightless-I homolog 
35 [Homo sapiens] POS=1009 (SEQ ID NO: 12) 

G=2693: Mg= 989.5: Tg=31+-2 #S+(15,2) #E+(7,3) #F+(13,5) 

#EST+(3,1) #G+(12 f 7) #K+(6,4) 

S=83(84,81) Mp= 989. 5 ( 0.0) Tp= 35 A2= 88/29 P=SLSEKTVLL 

40 genpept PR=>gi j 180151 | gb | AAA887 93 . 1 | (M84349) CD59 protein [Homo 
sapiens] POS=106 (SEQ ID NO: 11) 
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G=2729: Mg= 993.5: Tg=18+-4 #F+(2 / l) #EST+(4,2) #FR+(8,3) #G+(9,4) 
#K+(1, 1) 

5=92(97,81) Mp= 993. 6( 0.1) Tp= 22 A2=243/23 P=KLH G VN I NV 
qenpept PR=>gi | 12653083 I gb I AAH00307 . 1 | AAH00307 (BC000307) RNA 
5 binding motif protein 4 [Homo sapiens] POS=59 (SEQ ID NO: 117) 

G=2769: Mg= 999.5: Tg=35+-1 #H+(5,3) #I+(8,4) #J+<5,4) 
S=82(83,81) Mp= 999. 5 ( 0.0) Tp= 39 A2= 5/18 P=LVMAPRTVL 
genpept PR=>gi | 9738918 I gb f AAF978 47 . 1 | (AF129293) MHC class I antigen 
10 [Homo sapiens] POS=2 (SEQ ID NO: 118) 

G=2773: Mg= 999.6: Tg=45+-1 #D+(2,1) #E+(15,6) #F+(12,7) 

#EST+(3,1) #FR+(8,3) »G+(15,8) #K+(11,5) #L+(1,1) 

S=80(86,69) Mp= 999. 6( 0.0) Tp= 42 A2= 22/31 P=SIIGRLLEV 
15 genpept PR=>gi I 190516 | gb I AAA36508 . 1 1 (M63960) protein phosphatase-1 
[Homo sapiens] POS=ll (SEQ ID NO: 119) 

G=2785: Mg=1000.5: Tg=33+-1 #G+(14 / 6) #K+(2,2) 

3=77(77,79) Mp-1000.6( 0.1) Tp= 36 A2= 2/16 P=MAVALQLRV 
20 genpept PR=>gi I 1154 4742 | emb I CAC17582 . 1 | (AL121997) dJ104 3F6 . 1 . 1 
(Chediak-Higashi syndrome 1 (isoform 1)) [Homo sapiens] POS=2544 
(SEQ ID NO:120) 



G=2789: Mg-1000.6: Tg=26+-2 #F+(3,2) #EST+(2,2) #FR+(2,1) 

25 #G+(13,6) #K+(1,1) 

S=90(90,90) Mp=1000.4 (-0.2) Tp= 27 A2=656/30 P=GLNEEIARV 

genpept PR=>gi | 250187 3 I gb I AAB80726 . 1 I (AF0177 90) 

retinoblastoma-associated protein HEC [Homo sapiens] POS=330 (SEQ 
ID NO: 121) 

30 

G=2791: Mg=1001.3: Tg=40+-1 #F+(10,5) #G+(16,9) #K+(4,3) 
S=78(81,72) Mp=1001.6( 0.3) Tp= 19 A2=0.9/23 P= IMKVAQAKL 
genpept PR=>gi | 6941888 I gb I AAF32263 . 1 I AF170562_1 (AF170562) 

ubiquitin-specif ic processing protease [Homo sapiens] POS=875 (SEQ 
35 ID NO:122) 



G=2822: Mg=1004.2: Tg=27+-1 #G+(8,5) 

S=90(91,90) Mp=1004.4( 0.2) Tp= 30 A2= 88/25 P=TLSEVTNQL 

genpept PR=>gi | 12053045 | emb I CAB66698 . 1 I (AL136764 ) hypothetical 
40 protein [Homo sapiens] POS=484 (SEQ ID NO: 123) 



G=2829: Mg=1004.5: Tg=38+-1 #F+(3,2) #EST+(1,1) #FR+<2,1) #G+(9,6) 
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S = 91(92, 90) Mp=1004.6( 0.1) Tp= 37 A2=324/29 P-AL FEGKVQL 
qenpept PR=>gi | 104397 12 | dbj | BAB15550 . 1 | (AK026780) unnamed protein 
product [Homo sapiens] POS=442 (SEQ ID NO: 124) 

5 G=2833: Mg=1004.6: Tg=29+-0 #EST+(3,1) 

S=87(87,89) Mp=1004.6( 0.0) Tp= 31 A2= 32/28 P=GLKGRVFEV 
genpept PR=>gi I 854 17 9 I emb I CAA60827 . 1 | (X87373) ribosomal protein S3a 
[Homo sapiens] POS-61 (SEQ ID NO: 125) 

10 G=2835: Mg-1005.2: Tg=48+-0 #G+(3,3) 

S=84(83,89) Mp=1005.5( 0.3) Tp= 42 A2= 35/25 P=NIFPYPVGV 
genpept PR=>gi | 28224 60 | gb | AAC39565 . 1 | (AF030234) splicing factor 
Sipl [Homo sapiens] POS=912 (SEQ ID NO: 126) 

15 G=2872: Mg=1009.6: Tg=47+-1 #E+(2 f l) #EST+(5,2) #FR+(6,3) 

#K+ (15, 6) 

S=87(96,66) Mp=1009.7( 0.1) Tp= 52 A2= 3/18 P=LVSIVVAVPL 
genpept PR=>gi | 7023136 I dbj | BAA91851 . 1 I (AK001708) unnamed protein 
product [Homo sapiens] POS=23 (SEQ ID NO: 127) 

20 

G=2881: Mg=1010.5: Tg=28+-1 #G+(8,5) 

S=84(85,82) Mp=1010.5( 0.0) Tp= 20 A2=370/30 P=NMYGKWTV 
genpept PR->gi | 184 52 67 | gb | AAC51102 . 1 | (U56402) SUPT5H [Homo sapiens] 
POS=562 (SEQ ID NO: 128) 

25 

G=2891: Mg=1011.5: Tg=43+-1 #E+(9,5) #F+(1,1) #EST+(3,2) #FR+(6,3) 
#G+{13,7) #K+(5,3) #L+(3,2) 

S=79(78,82) Mp=1011.5( 0.0) Tp= 45 A2=>lk/31 P=LLLDVPTAAV 
genpept PR=>gi | 6165 618 | gb | AAF04 618 . 1 | AF097 362_1 (AF097 362) 

30 gamma-interf eron inducible lysosomal thiol reductase [Homo sapiens] 
POS=26 (SEQ ID NO:2) ref 

G=2918: Mg=1014.4: Tg=48+-0 #D+(1,1) #E+(16,8) #F+(ll f 7) 

#EST+(2,1) #FR+(3,2) #G+(19,10) 
35 S=88(97,68) Mp=1014.6( 0.2) Tp= 46 A2=160/32 P=SLINVGLISV 
genpept PR=>gi | 12 653413 | gb | AAH004 7 6 . 1 | AAH004 7 6 (BC00047 6) acidic 
protein rich in leucines [Homo sapiens] POS=48 (SEQ ID NO: 129) 

G=2928: Mg=1015.4: Tg=56+-0 #E+(26,8) #EST+(2,1) #FR+(5,3) 
40 S=92(97,81) Mp=1015.5( 0.1) Tp= 61 A2=666/30 P— ALLGTLWEI 
genpept PR=>gi | 2224595 I dbj I BAA20785 . 1 I (AB002325) KIAA0327 protein 
[Homo sapiens] POS=18 (SEQ ID NO: 130) 



w 
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G=2929: Mg=1015.4: Tg=41+-1 #E+(5,3) #EST+(4,2) #FR+(2,2) 

#G+(12 / 7) #K+(5,3) 

3=81(86,72) Mp=1015.5( 0.1) Tp= 39 A2 = 13/16 P=FQDPVPLTV 
genpept PR=>gi | 4 325107 | gb | AAD17 258 . 1 | (AF11904 2) transcriptional 
5 intermediary factor 1 alpha; TIFlalpha [Homo sapiens] POS=890 (SEQ 
ID NO:131) 

G=2947: Mg=1016.4: Tg=45+-1 #E+(3,3) #F+(8,5) #EST+(2,2) #FR+(3,1) 
#G+ (18, 10) #K+ (7, 4) 
10 S=82(95,54) Mp=1016.6( 0.2) Tp= 39 A2=512/28 P=GLYPNLIQV 
genpept PR=>gi | 4240269 | cibj | BAA7 4 913 . 1 1 (AB020697) KIAA0890 protein 
[Homo sapiens] POS=1022 (SEQ ID NO: 132) 

G=2965: Mg=1018.4: Tg=23+-4 #D+(3,1) #E+(2,2) #F+(2,2) #G+(25,8) 
15 3=94(96,90) Mp=1018.6{ 0.2) Tp= 19 A2= 79/26 P=VMDSKIVQV 
genpept PR=>gi | 913393 I gb| AAC60648 . 1 I (S75295) nucleoprotein 

interactor 1, NPI-l^SRPl homolog [human, cervical carcinoma HeLa 
cells, Peptide, 538 aa] [Homo sapiens] POS=434 (SEQ ID NO: 133) 

20 G=2976: Mg=1019.6: Tg=46+-0 #D+(5,2) #E+(6,1) #F+(2,2) #EST+(1,1) 
#FR+(2, 1) #G+(4,3) 

S=83(81,90) Mp=1019.6( 0.0) Tp= 40 A2-745/32 P=ALLDKLYAL 
genpept PR=>gi | 702334 1 | dbj | BAA91929 . 1 I (AK001830) unnamed protein 
product [Homo sapiens] POS=78 (SEQ ID NO: 134) 

25 

G=2985: Mg=1020.5: Tg=45+-0 #D+(5,3) #E+(3,2) #F+(4,3) #FR+(1,1) 
#G+ (7,5) 

S=99(99,99) Mp=1020.5( 0.0) Tp= 40 A2=298/27 P=NLASFIEQV 
genpept PR=>gi | 348907 | gb | AAA35672 . 1 | (L15428) 4a-carbinolamine 
30 dehydratase [Homo sapiens] POS=19 (SEQ ID NO: 135) 

G=2998: Mg=1022.4: Tg=44+-0 #G+(5,3) 

S=76(70,90) Mp-1022.4( 0.0) Tp= 43 A2=0.7/12 P=TLWVDPYE 

genpept PR=>gi | 1703501 | gb | AAB37580 . 1 | (U72649) BTG2 [Homo sapiens] 
35 POS=101 (SEQ ID NO: 136) 

G=3002: Mg=1022.5: Tg=45+-1 #S+(2,1) #D+(3,2) #G+(7,4) 
S=82(83,81) Mp=1022.5( 0.0) Tp= 42 A2=>lk/25 P=KIADFGWSV 
genpept PR=>gi | 3127068 I gb | AAC77369 . 1 | (AF059681) serine/threonine 
40 kinase 13 [Homo sapiens] POS=147 (SEQ ID NO: 137) 

G=3036: Mg=1025.5: Tg=37+-1 #S #D+(1,1) #F+(4,2) #EST+(1,1) 
#G+ (5, 3) 
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S=90(91,90) Mp=1025.6( 0.1) Tp= 36 A2= 89/28 P=SLLSHVEQL 
genpept PR=>gi | 5305429 | gb IAAD41647 . 1 |AF072933_1 (AF072933) Mad2-like 
protein [Homo sapiens] POS=114 (SEQ ID NO: 138) 

5 G=3041: Mg=1026.3: Tg=45+-0 #D+(7,3) #FR+(1,1) #G+(4,3) 

S=84(90,72) Mp=1025.6(-0.7) Tp= 38 A2=>lk/30 P=GLADKVYFL 
genpept PR=>gi | 1228049 | db j I BAA11423. 1 I (D78586) multifunctional 
protein CAD [Homo sapiens] POS=445 (SEQ ID NO: 139) 

10 G=3061: Mg=1028.5: Tg=35+-1 #S+(6,2) #D+(3,1) #E+(20,7) #F+(8,5) 
#EST+(5,2) #FR+(5,2) IG+(11,7) 

S=88(92,81) Mp=1028.5( 0.0) Tp= 32 A2 = 88/28 P=GLIEKNIEL 
genpept PR=>gi | 1632819 I emb I CAA45219 . 1 I (X63692) DNA 

(cytosine-5-) -methyltransf erase [Homo sapiens] POS=425 ( SEQ ID 
15 NO:13) 

G=3073: Mg=1029.5: Tg=51+-0 #D+(1,1) #FR+(2 / l) #G+(5,4) #K+(2,1) 
S=81(78,90) Mp=1029.6( 0.1) Tp= 35 A2=>lk/31 P=SLLDIIEKV 
genpept PR=>gi | 1063586 | gb |AAB4 1564 . 1 | (L48546) tuberin [Homo 
20 sapiens] POS=526 (SEQ ID NO: 140) 

G=3092: Mg=1031.4: Tg=61+-1 #S+(8,2) #D+(29,4) #E+(10,6) #F+(3,1) 
#EST+(2,1) #FR+(5,2) #H+(1,1) #K+(1,1) 

S=84(82,90) Mp=1031.6( 0.2) Tp= 64 A2=865/30 P=GLYPGLIWL 
25 genpept PR=>gi | 25 99385 I gb I AAB84 111 . 1 | (AF027292) interferon 

regulatory factor 6 [Homo sapiens] POS=21 ( SEQ ID NO: 14) 

G=3118: Mg=1034.4: Tg=60+-1 #D+(16,5) #E+(45,9) #F+(14,7) 

#EST+(2,1) #FR+(10,3) #G+(11,7) #K+(7,4) 
30 S=81(85,72) Mp-1034.6( 0.2) Tp= 66 A2= 32/21 P=FVFPGELLL 
genpept PR=>gi | 12 652 633 | gb | AAH00062 . 1 | AAH000 62 (BC0000 62) solute 
carrier family 1 (neutral amino acid transporter) , member 5 [Homo 
sapiens] POS=89 (SEQ ID NO: 141) 

35 G=3127: Mg=1036.3: Tg=36+-0 #F+(2,1) 

5=78(77,81) Mp=1036.6( 0.3) Tp= 35 A2=656/30 P=ALNELLQHV 
genpept PR=>gi | 6682361 | gb I AAF23322 . 1 1 AF177 1 98_1 (AF177198) talin 
[Homo sapiens] POS=777 (SEQ ID NO:142)ref 

40 G=3128: Mg=1036.3: Tg=36+-1 #G+(12,7) 

S=83(84,81) Mp=1036.5( 0.2) Tp= 29 A2=913/27 P=NLYEGQITV 
genpept PR=>gi | 1699038 j gb I AAC50967 . 1 | (U78735) ABC 3 [Homo sapiens] 
POS=555 (SEQ iD NO:143) 
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G=3142: Mg-1037.5: Tg=43+-1 #EST+(2,2) #FR+(5,2) #G+(1,1) 

#I+(11,5) #J+(5,3) 

S=86(89,79) Mp=1037.5( 0.0) Tp= 41 A2=0.1/15 P=FTKDFAPVI 
5 qenpept PR=>gi | 7022824 | dbj 1 BAA91736 . 1 | (AK001518) unnamed protein 
product [Homo sapiens] POS=77 (SEQ ID NO: 144) 

G=3144: Mg=1037.6: Tg=51+-1 #D+(7,3) #E+{29,6) #F+(11,7) 

#EST+(2,1) #FR+(4,2) #G+(12,7) #K+(3,2) #L+(1,1) 
10 S=87{86,90) Mp=1037.7( 0.1) Tp= 53 A2=>lk/31 P=KLLEPVLLL 
genpept PR=>gi | 3384 47 | gb | AAA60583 . 1 | (M60854) RPS16 [Homo sapiens] 
POS=50 {SEQ ID NO:145)ref 

G=3154: Mg=1038.5: Tg=48+-1 #D+(32,4) #E+(48,9) #F+(7,5) 

15 #EST+(6,2) #FR+(9,3) #G+(24,10) #K+(9,6) 

S=81(82,81) Mp=1038.7( 0.2) Tp= 47 A2=408/30 P=YLLPAIVHI 
genpept PR=>gi | 2832596 I emb I CAB09792 . 1 j (Z97056) dJ434P1.3 (DEAD/H 
(Asp-Glu-Ala-Asp/His) box polypeptide 17 (72kD) ) [Homo sapiens] 
POS=146 (SEQ ID NO: 15) ref 

20 

G=3183: Mg=1041.4: Tg=52+-0 #FR+(2,1) #G+(1,1) 

S=79(82,72) Mp=1041.6( 0.2) Tp= 52 A2=>lk/23 P=GLFAPQFYV 
genpept PR=>gi I 206237 1 | gb I AAB658 50 . 1 I (U70730) SnoN2 [Homo sapiens] 
POS=274 (SEQ ID NO: 146) 

25 

G=3191: Mg=1042.4: Tg=29+-1 #S+(2,1) #G+(12,6) 

S=87(90,81) Mp=1042.5( 0.1) Tp= 27 A2=805/27 P-LMVDHVTEV 
genpept PR=>gi | 9930612 | gb I AAG02115 . 1 | AF293025_1 (AF293025) steroid 
receptor RNA activator isoform 2 [Homo sapiens] POS=183 (SEQ ID 
30 NO: 147) 

G=3201: Mg=1043.5: Tg=58+-1 #E+(2,2) #F+(6,4) #EST+<1,1) #FR+{2,1) 
#G+(7,4) #K+(1,1) 

S=85(88,81) Mp=1043.7{ 0.2) Tp= 62 A2=408/27 P=FLLPILSQI 
35 genpept PR=>gi | 2580552 | gb | AAC51830 . 1 | (AF000983) dead box, X isoform 
[Homo sapiens] POS=234 (SEQ ID NO: 148) 

G-3213: Mg=1045.5: Tg=58+-0 #E+(1,1) #F+(2,2) #FR+(1,1) 
3 = 84(90,72) Mp=1044.5 (-1.0) Tp= 54 A2 = 0.3/18 P=FLIPLNITN 

40 genpept PR=>gi | 2224 611 I dbj | BAA20793 . 1 | (AB002333) KIAA0335 [Homo 
sapiens] POS=938 (SEQ ID NO: 149) 
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G=3219: Mg=1046.6: Tg=40+-1 #D+<2,2) #E+(1,1) #F+(2,1) #EST+(4,1) 
#FR+(5,2) #G+(1,1) #K+(3,3) 

S=85(83,90) Mp=1046.7( 0.1) Tp= 39 A2=243/30 P=NLLPKLHIV 
qenpept PR=>gi | 4 58 8524 I gb | AAD2 6136.1| AF1 0919 6_1 (AF109196) 

5 intracellular chloride channel p64Hl [Homo sapiens] POS=190 (SEQ ID 
NO: 150) 

G=3227: Mg=1047.6: Tg=44+-0 #D+(1,1) #E+(15,8) #F+(2,2) #EST+(5,2) 
#FR+(7,3) #G+<12,6) 
10 S=79(82,72) Mp=1047.6( 0.0) Tp= 50 A2=413/31 P=LLDRFLATV 
genpept PR=>gi | 12653303 I gb | AAH00420 . 1 I AAH00420 (BC000420) cyclin I 
[Homo sapiens] POS=72 (SEQ ID NO: 151) 

G=3240: Mg=1049.4: Tg=41+-1 #E+(3,2) #F+(2,2) #EST+(2,2) #FR+(3,3) 
15 S=77(79,74) Mp=1049.5( 0.1) Tp- 36 A2=294/29 'P=YLDPSVLSGV 
genpept PR=>gi I 5050 98 I dbj | BAA06683 . 1 | (D31885) KIAA0069 [Homo 
sapiens] POS=84 (SEQ ID NO: 152) 

G=3242: Mg=1049.5: Tg=44+-0 #F+(8,6) 
20 S=78(85,63) Mp=104 8 . 5 (-1 . 0 ) Tp= 44 A2=378/27 P=LLYPTEITV 

genpept PR=>gi | 220141 I dbj I BAA00845 . 1 I (D01038) VLA-3 alpha subunit 
[Homo sapiens] POS=798 (SEQ ID NO: 153) 

G=3257: Mg-1051.4: Tg=65+-1 #D+(2,2) #E+(9,4) #F+(9,4) #EST+(2,1) 
25 #FR+(6,3) 

S=88(88,90) Mp=1051.6( 0.2) Tp= 63 A2=>lk/26 P=NLGDFLIFL 
genpept PR=>gi | 14 69175 | dbj I BAA09475 . 1 I (D50916) The KIAA0126 gene is 
partially related to a yeast gene. [Homo sapiens] POS=638 (SEQ ID 
NO: 154) 

30 

G=3258: Mg=1051.4: Tg=54+-0 #D+(18 / 4) #E+(10,6) #F+(8,5) #G+(1,1) 
#K+ (3,2) 

3=79(85,66) Mp=1051.5( 0.1) Tp= 56 A2=>lk/30 P=GLYEGLTWL 
genpept PR=>gi | 178989 | gb | AAA90928 . 1 | (M57763) ADP-ribosylation 
35 factor [Homo sapiens] POS=161 (SEQ ID NO: 155) 

G=3270: Mg=1054.3: Tg=51+-0 #D+(5,3) #E+(19,8) #F+(12,7) 

#EST+(2,1) #FR+(5,2) 

S=96(96,99) Mp=1054.5( 0.2) Tp= 48 A2=437/19 P=SLFDLNFQA 
40 genpept PR=>gi | 189292 | gb | AAB60701 . 1 | (M81600) NAD ( P ) H : quinone 

oxireductase [Homo sapiens] POS=227 (SEQ ID NO: 156) 



G=3271: Mg=1054.3: Tg=55+-1 #K+(5,2) 
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S-80(77,90) Mp=1054.4( 0.1) Tp= 43 A2=0.0/8 P=MFSLEDSII 
qenpept PR=>gi | 80902 9 I emb I CAA57 993 . 1 I (X82676) tyrosine phosphatase 
[Homo sapiens] POS=833 (SEQ ID NO: 157) 

5 G=3279: Mg-1055.4: Tg=37+-1 #G+(6,4) 

5=76(74, 81) Mp=1055.3 (-0.1) Tp= 37 A2=122/19 P=AMWEHPITA 
genpept PR=>gi | 101 97 638 I gb | AAG14 955 . 1 1 AF1824 1 9_1 (AF182419) MDS018 
[Homo sapiens] POS=65 (SEQ ID NO: 158) 

10 G=3297: Mg=1057.5: Tg=17+-2 #G+(8,4) 

S=95(94,99) Mp=1057.6( 0.1) Tp= 31 A2=320/26 P=YLGRLAHEV 
genpept PR=>gi | 12 6534 85 | gb | AAH00514 . 1 | AAH00514 (BC000514 ) ribosomal 
protein L13a [Homo sapiens] POS=137 ( SEQ ID NO: 159) 

15 G=3309: Mg=1059.5: Tg=34+-0 #F+(3,3) #EST+(3,1) #FR+(2,1) 

S=82(84,79) Mp=1059.6( 0.1) Tp= 30 A2=482/24 P=GLIDHQTYL 
genpept PR=>gi | 1477 651 I gb I AAB05428 . 1 1 (U63610) plectin [Homo 
sapiens] POS=4188 (SEQ ID NO: 160) 

20 G=3325: Mg=1061.4: Tg=40+-1 #G+<6,3) 

S=85(87,81) Mp=1061.7( 0.3) Tp= 31 A2=523/26 P=AIQDKLFQV 
genpept PR=>gi I 1354 3970 I gb I AAH06123 . 1 1 AAH06123 (BC006123) Similar to 
RIKEN cDNA 0710001P09 gene [Homo sapiens] POS=96 (SEQ ID NO: 161) 

25 G-3329: Mg=1062.4: Tg=29+-0 #H+(6,3) 

S=89(89,89) Mp=1062.5( 0.1) Tp= 27 A2=0.0/9 P=IVKWDRDM 
genpept PR=>gi | 179318 | gb | AAA518 1 1 . 1 I (M17987) beta-2-microglobulin 
[Homo sapiens] POS=112 (SEQ ID NO: 162) 



30 G=3331: Mg=1062.5: Tg=33-f-l #F+(9,6) #EST+(1,1) #G+(18,10) 

#K+{4, 3) 

S=86(97,63) Mp=1062.6( 0.1) Tp= 30 A2= 6/20 P=RIIDVVYNA 
genpept PR=>gi | 36150 | emb | CAA4 7 67 0 . 1 | (X67247) ribosomal protein S8 
[Homo sapiens] POS=77 (SEQ ID NO: 163) 

35 

G=3342: Mg=1064.4: Tg=20+-4 #E+(1,1) #F+(6,4) #EST+(3,1) #G+(16,6) 
S=86(88,82) Mp=1064.6( 0.2) Tp= 19 A2=439/28 P=KI YEGQVEV 
genpept PR=>gi | 550013 | gb | AAA85654 . 1 I (U14966) ribosomal protein L5 
[Homo sapiens] POS=117 (SEQ ID NO: 164) 

40 

G=3364: Mg=1066.3: Tg=50+-0 #E+(3,2) #G+(18,10) #K+(9,5) #L+(1,1) 
S=81(85,72) Mp=1066.6( 0.3) Tp= 50 A2-736/26 P=FLPSYIIDV 
genpept PR=>gi | 104 5574 | gb I AAC50293 . 1 | (U37012) cleavage and 



1 
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polyadenylation specificity factor [Homo sapiens] POS=185 (SEQ ID 
NO: 165) 

G-3384: Mg=1068.4: Tg=29+-2 #S+(8,2) #D+(3,1) #E+(17,4) #F+(8,5) 
5 #EST+(5,2) #FR+(6,3) #G+(18,7) #K+(4,2) 

S=87(90,81) Mp=1068.6( 0.2) Tp= 29 A2=482/23 P=ALSDHHIYL 
genpept PR=>gi I 28597 | emb I CAA28861 . 1 I (X05236) aldolase A (AA 1-364) 
[Homo sapiens] POS=216 (SEQ ID NO: 16) ref 

10 G=3385: Mg-1069.3: Tg=37+-1 #F+(2,2) #G+(7,5) 

S=84(92,66) Mp=1069.3( 0.0) Tp= 25 A2=855/25 P=YMMPVNSEV 
genpept PR=>gi | 12667401 | gb I AAK0 1 4 2 6 . 1 j AF32 67 3 1_1 (AF326731) NUF2R 
[Homo sapiens] POS=65 (SEQ ID NO: 166) 

15 G=3406: Mg=1071.5: Tg=20+-3 #D+(1,1) #FR+(2,1) #G+(20,7) 

S=94(97,90) Mp=1071.6( 0.1) Tp= 24 A2=109/30 P-ILDQKINEV 
genpept PR=>gi | 338278 | gb I AAA60563 . 1 | (M31061) ornithine 

decarboxylase [Homo sapiens] POS=23 (SEQ ID NO: 17) ref 

20 G=3410: Mg=1071.6: Tg=8+-8 #D+(7,1) #F+(6,1) #FR+(6,1) #G+(25,5) 
#K+ (4, 3) 

S=94(96,90) Mp=1071.7( 0.1) Tp= 12 A2= 53/29 P=ILDKKVEKV 
genpept PR=>gi j 38678 6 I gb I AAA3602 6. 1 I (J04988) 90 kD heat shock 
protein [Homo sapiens] POS=570 (SEQ ID NO: 18) ref 



25 



30 



35 



G=3418: Mg=1073.6: Tg=5+-7 #F+(1,1) #G+(3,3) 

S=79(79,81) Mp=1072.5(-l.l) Tp= 5 A2=0.1/14 P=NKDLKMPKV 
genpept PR=>gi | 180857 8 | dbj I BAA07 918 . 1 I (D444 66) proteasome subunit 
pll2 [Homo sapiens] POS=792 (SEQ ID NO: 167) 

G=3424: Mg=1074.6: Tg=46+-0 #K+(6,4) 

S=90(90,90) Mp=1074.4 (-0.2) Tp= 31 A2=201/28 P=NLAEDIMRL 

genpept PR=>gi | 37852 | emb I CAA79613 . 1 I (Z19554) vimentin [Homo 
sapiens] POS=177 (SEQ ID NO: 168) 



G=3427: Mg=1075.4: Tg=46+-0 #D+(1,1) #F+(8,6) 

S=72(73,72) Mp=1075.6( 0.2) Tp= 44 A2=>lk/31 P=YLPELLQTV 
genpept PR=>gi | 12 6532 99 | gb | AAH004 18 . 1 I AAH004 18 (BC000418) 

ectodermal-neural cortex (with BTB-like domain) [Homo sapiens] 
40 POS=228 (SEQ ID NO: 169) 



G=3470: Mg=1080.4: Tg=62+-1 #D+(13,4) #E+(17,8) #F+{11,6) 

#EST+(2,1) #FR+(6,3) #G+(19,9) #K+(10,5) #L+(1,1) 
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S=82{82,82) Mp=1080.6( 0.2) Tp= 69 A2=>lk/27 P=FLYPFPLAL 

qenpept PR=>gi I 436224 | dbj I BAA05062 . 1 I (D26067) KIAA0033 [Homo 
sapiens] POS=185 (SEQ ID NO: 170) 

5 G=3472: Mg=1080.4: Tg=50+-0 #F+(13,7) #G+(25,11) #K+{10,5) 

S=76(87,52) Mp=1080.5( 0.1) Tp= 53 A2=182/33 P=SLLPPTALVGL 
genpept PR=>gi | 1296664 | emb j CAA65774 . 1 | (X97064) Sec23 protein [Homo 
sapiens] POS=156 { SEQ ID NO: 19) 

10 G=3476: Mg=1080.7: Tg=41+-1 #FR+(1,1) #G+(5,4) 

5=75(77,72) Mp=1080 . 6 ( -0 . 1 ) Tp= 38 A2=>lk/29 P=NLYPFVKTV 

genpept PR=>gi | 12 631 9 6 | gb | AAA97 4 05 . 1 | (U37436) A I CAR 

formyltransferase/IMP cyclohydrolase bifunctional enzyme [Homo 
sapiens] POS=101 (SEQ ID NO: 171) 

15 

G=3477: Mg-1081.4: Tg-56+-0 #F+(6,3) 

S=90(87,99) Mp=1081.7( 0.3) Tp= 57 A2=>lk/24 P=SVIEQLFFV 
genpept PR=>gi | 3014 0 | emb I CAA34277 . 1 | (X16155) COUP-TF [Homo sapiens] 
POS=378 (SEQ ID NO: 172) 

20 

G=3478: Mg=1081.4: Tg=56+-0 #G+(4,3) 

S=84(86,81) Mp=1080.6(-0.8) Tp= 57 A2=>lk/29 P=SLLEPFVYL 

genpept PR=>gi I 7008404 | gb I AAF34 999 . 1 I (AF229840) kappa B-ras 2 [Homo 
sapiens] POS=156 (SEQ ID NO: 173) 

25 

G=3497: Mg=1084.7: Tg=24+-3 #F+(1,1) #EST+(3,1) #FR+(2,1) 
S=79<82,72) Mp=1084 . 6 (-0. 1) Tp= 37 A2=437/26 P=ILFGHENRV 

genpept PR=>gi | 5911941 | emb I CAB55946. 1 1 (AL117471) hypothetical 
protein [Homo sapiens] POS=250 (SEQ ID NO:174)ref 

30 

G=3505: Mg=1086.5: Tg=19+-3 #G+(18,6) 

S=88(91,83) Mp=1086.7( 0.2) Tp= 20 A2=998/29 P=KLQEVGQVSV 
genpept PR=>gi j 340307 | gb I AAA36808 . 1 I (M14648) vitronectin alpha 
subunit precursor [Homo sapiens] POS=338 (SEQ ID NO: 175) 

35 

G-3520: Mg=1088.6: Tg=37+-1 #F+(3,2) #EST+(2,1) #G+(8,5) #K+(5,4) 
S=72(82,49) Mp=1088 . 5 (-0. 1) Tp= 37 A2= 75/16 P=RLFDEPQLA 
genpept PR=>gi | 3334 982 | gb|AAC26984 . 1 IAAC2698 4 (AC005306) R27216_l 
[Homo sapiens] POS=2 (SEQ ID NO: 176) 

40 

G=3521: Mg=1088.6: Tg=44+-1 #G+(5,4) 



83 

S=70(71,68) Mp=1088.6( 0.0) Tp= 43 A2-119/30 P=SLFPGKLEVV 
genpept PR=>gi | 440177 | gb I AAC03568 . 1 I (U01184) flightless-I homolog 
[Homo sapiens] POS=1009 (SEQ ID NO: 177) 

5 G=3526: Mg=1089.6: Tg=50+-1 #S+(10,1) 

S=84(91,68) Mp=1089.6( 0.0) Tp= 57 A2- 37/23 P-VMLGTPFLVI 
genpept PR=>gi | 4 58 9536 | db j | BAA7 67 90 . 1 I (AB023163) KIAA0946 protein 
[Homo sapiens] POS=340 (SEQ ID NO: 178) 

10 G=3533: Mg=1091.4: Tg=15+-2 #G+(12,4) 

S=86(95,68) Mp=1091.4( 0.0) Tp= 13 A2= 80/20 P=GVYDGEEHSV 
genpept PR=>gi I 4 10274 9 | gb I AAD01565 . 1 [ (AF015766) MAGE XP-2 protein 
[Homo sapiens] POS=231 ( SEQ ID NO: 20) 

15 G=3545: Mg=1094.4: Tg=50+-1 #E+(22,9) #F+(16,7) #EST+(4,2) 

#FR+(4,2) #G+(50,11) #K+(11,6) #L+(2,2) 

S=80(89,59) Mp=1094.5( 0.1) Tp= 49 A2=182/33 P=SLLPPDALVGL 
genpept PR=>gi | 135292 99 | gb | AAH054 0 4 . 1 | AAH054 04 (BC0054 04) Unknown 
(protein for MGC:5020) [Homo sapiens] POS=156 (SEQ ID NO: 21) 

20 

G=3563: Mg=1098.3: Tg=38+-1 #D+(7,3) #E+(10,6) #F+(5,4) #EST+(3,2) 
#FR+ (4, 2) #G+ (12, 8) 

S=88(88,90) Mp-1098.4( 0.1) Tp= 30 A2=280/26 P=SLYDYNPNL 
genpept PR=>gi | 3337 38 3 I gb | AAC27426 . 1 ! (AC002 54 4) Translation 

25 initiation factor elF-pllO [Homo sapiens] POS=381 (SEQ ID NO: 179) 

G=3566: Mg=1098.6: Tg=50+-0 #E+(6,3) #EST+(3,1) #FR+(2 / l) 
S=82(89,68) Mp=1098.7( 0.1) Tp= 62 A2=194/25 P=FLLGPRLVLA 
genpept PR=>gi | 887368 | gb | AAC42003 . 1 | (L40397) ORF; putative [Homo 
30 sapiens] POS-31 (SEQ ID NO: 180) 

G=3579: Mg-1101.4: Tg=34+-1 #F+(5,4) #G+(2,2) 

S=88(91,83) Mp=1101.4( 0.0) Tp= 35 A2=502/24 P=FLYTGEGDTV 
genpept PR=>gi ! 1184 320 I gb I AAC50373 . 1 | (U45880) X-linked inhibitor of 
35 apotosis protein [Homo sapiens] POS=52 (SEQ ID NO: 181) 

G=3588: Mg=1102.6: Tg=34+-1 #E+(1,1) #EST+(4,2) #FR+(2,2) 
S=82(83,81) Mp=1102.6( 0.0) Tp= 27 A2=>lk/26 P=KLNPQQFEV 
genpept PR=>gi | 624704 | gb IAAB05994 . 1 | (L38961) putative transmembrane 
40 protein precursor [Homo sapiens] POS=289 (SEQ ID NO: 182) 

G=3596: Mg=1103.4: Tg=28+-2 #F+(3,2) #G+(1,1) 
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S=88(94,74) Mp=1103.4( 0.0) Tp= 23 A2=140/27 P=SLADLQNDEV 
qenpept PR=>gi | 854 17 9 I emb I CAA60827 . 1 | (X87373) ribosomal protein S3a 
[Homo sapiens] POS=70 (SEQ ID NO: 183) 

5 G=3603: Mg=1104.7: Tg=45+-0 #D+(1,1) #F+(4,3) #EST+(3,1) #FR+(5,2) 
S=85(99,54) Mp=1104.7( 0.0) Tp= 40 A2=364/28 P=RLLDYVVNI 
genpept PR=>gi | 70237 68 I db j | BAA9208 1 . 1 | (AK002094) unnamed protein 
product [Homo sapiens] POS=172 (SEQ ID NO: 184) 

10 G=3629: Mg=1113.5: Tg=35+-1 #G+(10,6) #K+(10,5) 

S=77(76,81) Mp=1113.6( 0.1) Tp= 31 A2= 46/21 P=FVDDYTVRV 
genpept PR=>gi | 1923256 I gb I AAC51866 . 1 I (U86782) 26S 

proteasome-associated padl homolog [Homo sapiens] POS=61 (SEQ ID 
NO: 185) 

15 

G=3637: Mg-1115.4: Tg=55+-1 #E+(29,8) #F+(14,7) #EST+(2,1) 

#FR+(4,2) #G+(9,5) #L+(1,1) 

S=82(90,66) Mp=1115.5( 0.1) Tp= 61 A2->lk/29 P=SLFEGTWYL 
genpept PR=>gi | 12 653065 | gb| AAH002 97 . 1 | AAH00297 (BC000297) 

20 3-hydroxy-3-methylglutaryl-Coenzyme A synthase 1 (soluble) [Homo 
sapiens] POS=447 (SEQ ID NO: 186) 

G=3652: Mg=1119.5: Tg=56+-0 #D+(4,3) 

S=80(84,72) Mp=1119.7( 0.2) Tp- 57 A2=512/27 P=ALYNWLIQV 

25 genpept PR=>gi | 32884 47 | emb I CAA07553 . 1 | (AJ007558) nucleoporin 155 
[Homo sapiens] POS=1038 (SEQ ID NO: 187) 

G=3653: Mg=1119.6: Tg=30+-1 #D+(1,1) #F+(3,2) #EST+(1,1) #FR+(2,1) 
#G+ (10, 5) 

30 S=80(84,72) Mp=1119.7( 0.1) Tp= 30 A2= 97/25 P=VLIDYQRNV 
genpept PR=>gi | 2626840 | dbj | BAA234 15 . 1 | (D89729) CRM1 protein [Homo 
sapiens] POS=784 (SEQ ID NO: 188) 

G=3658: Mg=1121.3: Tg=49+-0 #S+(9,2) #D+(7,3) #E+(8,5) #F+(14,7) 
35 #G+(26,11) #K+(2,2) #L+(1,1) 

S=84(81,91) Mp=1121.5( 0.2) Tp= 47 A2=577/24 P=TLWVDPYEV 
genpept PR=>gi | 1703501 | gb | AAB37 5 8 0 . 1 I (U72649) BTG2 [Homo sapiens] 
POS=101 (SEQ ID NO:22) ref 

40 G=3683: Mg=1128.3: Tg=51+-0 #S #G+(17,9) #K+(2,1) 

S=86(88,82) Mp-1128.5( 0.2) Tp= 55 A2=348/25 P=FTWEGLYNV 

genpept PR=>gi | 1276912 | gb | AAC504 50 . 1 I (U44839) UHX1 protein [Homo 
sapiens] POS=353 (SEQ ID NO: 189) 
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G=3694: Mg=1133.6: Tg=25+-3 #D+(2,1) #F+(1,1) #G+(7,3) 
5=85(87,81) Mp=1133.7< 0.1) Tp= 30 A2=>lk/32 P=ILMEHIHKL 
qenpept PR=>gi | 2 984 8 6 | gb | AAB25672 . 1 | (S56985) ribosomal protein L19 
5 [human, breast cancer cell line, MCF-7, Peptide, 196 aa] [Homo 
sapiens] POS=137 (SEQ ID NO: 190) ref 

G=3697: Mg=1134.6: Tg=42+-1 #E+(12,6) #F+(1,1) #EST+(3,1) 

#FR+(3,1) #G+(9,5) #K+(13,6) #L+(1,1) 
10 S=81(93,53) Mp=1134.6( 0.0) Tp= 37 A2=193/26 P=RLDELGGVYL 
genpept PR=>gi | 13374901 I emb | CAC34 517 . 1 | (AL031659) dJ343K2.2.3 
(ribophorin II (isoform 3)) [Homo sapiens] POS=185 (SEQ ID NO:191) 

G=3711: Mg=1140.6: Tg=40+-1 #EST+(1,1) #FR+(1,1) #G+(1,1) 
15 S=89(93,82) Mp=1140.7( 0.1) Tp= 40 A2=526/27 P=KLLSKFYEL 
genpept PR->gi | 104 39903 j dbj I BAB15591 . 1 I (AK026930) unnamed protein 
product [Homo sapiens] POS=231 (SEQ ID NO: 192) 

G=3721: Mg=1145.4: Tg=49+-1 #S+(7,1) #F+(2,2) #G+(14,10) 
20 S=79(83,70) Mp=1145.5( 0.1) Tp= 50 A2=>lk/23 P=FLFDGSPTYV 
genpept PR=>gi | 104 9053 I gb | AAC50259 . 1 | (U26644) encodes region of 
fatty acid synthase activity; FAS; multifunctional protein [Homo 
sapiens] POS=2329 (SEQ ID NO:23) 

25 G=3728: Mg=1147.5: Tg=48+-1 #E+(3,2) #EST+(2,1) #FR+(5,2) #G+(4,3) 
#K+ (13, 6) 

S=91(92,90) Mp=1147.7( 0.2) Tp= 45 A2=>lk/20 P=KVLDFEHFL 

genpept PR=>gi | 189022 | gb I AAA36348 . 1 I (M22920) smooth muscle mysoin 
light chain [Homo sapiens] POS=28 (SEQ ID NO: 193) 

30 

G=3743: Mg=1152.6: Tg=47+-0 #D+(5,3) #F+(1,1) 

S=79(82,72) Mp=1151.6(-1.0) Tp= 43 A2=>lk/24 P=YLPEDFIRV 

genpept PR=>gi | 2 6538 7 7 | gb I AAB87 669 . 1 I (AF02 627 3) interleukin-1 
receptor-associated kinase-2; IRAK-2 [Homo sapiens] POS=381 (SEQ ID 
35 NO:194) 

G=3754: Mg=1156.5: Tg=35+-1 #G+{3,2) #K+(8,5) 

S=91(95,83) Mp=1156.5( 0.0) Tp= 43 A2=403/28 P=FLSEHPNVTL 
genpept PR=>gi | 51028 31 | emb | CAB45270 . 1 | (AL022 318) bK150C2.2 

40 (Phorbolin 3) [Homo sapiens] POS=107 (SEQ ID NO: 195) 

G=3806: Mg=1210.4: Tg=42+-1 #E+(7,4) #EST+(1 / 1) #FR+(5,3) 

#G+(20,11) #K+(10 f 6) #L+(4,2) 
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S=76(80,68) Mp=1210.6( 0.2) Tp= 44 A2=128/21 P=LLLDVPTAAVQA 
genpept PR=>gi | 6165618 | gb| AAF04618 . 1 | AF097362_1 (AF097362) 

gamma-interf eron inducible lysosomal thiol reductase [Homo sapiens] 
POS=26 (SEQ ID NO: 3) ref 

5 

G=3831: Mg=1258.5: Tg=54+-1 #S+(12,2) #E+(12,6) #F+(12,6) 

#EST+(1,1) #FR+(7,3) #G+(20,10) #H+(1,1) #K+(10,5) 

S=87(96,68) Mp=1258.6( 0.1) Tp= 58 A2=611/27 P=FLFDGSPTYVL 
genpept PR=>gi I 104 9053 I gb | AAC50259 . 1 1 (U26644) encodes region of 
10 fatty acid synthase activity; FAS; multifunctional protein [Homo 
sapiens] POS=2329 (SEQ ID NO:24) 

G=3859: Mg=1360.4: Tg=44+-1 #E+(3,2) #G+(19,10) 

S=91(99,75) Mp=1360.6( 0.2) Tp= 42 A2=>lk/28 P=ALWDIETGQQTV 
15 genpept PR=>gi I 306785 | gb I AAA35922 . 1 | (M16538) G protein beta subunit 
[Homo sapiens] POS=167 {SEQ ID NO: 25) 

G=1451: Mg= 854.3: Tg=28+-1 #S+(4,1) #H+(5,3) 

S=84(78,99) Mp=854.3 Tp=29 B7=120/27 P=VPSEPGGVL 

20 genpept PR=>gi | 17324 1 9 | gb | AAB51323 . 1 | (U47924) protein tyrosine 
phosphatase 1C [Homo sapiens] POS=422 (SEQ ID NO: 26) 

G=1637: Mg- 880.3: Tg=37+-0 #H+(5,3) 

S=88{89,88) Mp= 880.4 ( 0.1) Tp= 40 A2= 2/23 P=IITGPAPVL 
25 genpept PR=>gi | 7 54 2357 | gb | AAF634 17 . 1 | AF142422_1 (AF142 4 22) QUAKING 
isoform 3 [Homo sapiens] POS=250 .(-SEQ ID NO: 57) 

G=1668: Mg= 883.3: Tg=29+-0 #J+(4,2) 

S=87(82,99) Mp=883.4 Tp=27 B7= 80/20 P=SPTQPIQL 

30 genpept PR=>gi | 1227484 0 | emb I CAC22308 . 1 | (AL354836) bA157P1.2 (cell 

membrane glycoprotein (surface antigen),. 110000 M(r) (GP110) ) [Homo 
sapiens] POS-257 (SEQ ID NO: 27) 

G=1739: Mg= 895.4: Tg=38+-0 #S+(2,1) #H+(6,3) 

35 S=80(81,79) Mp=895.4 Tp=34 B7-120/27 P=S PALPGLKL 

genpept PR=>gi | 2554 94 8 I gb|AAC517 90. 1 | (AF023 614) transmembrane 

activator and CAML interactor [Homo sapiens] POS=147 (SEQ ID NO: 28) 

G=1783: Mg= 899.4: Tg=16+-3 #H+(8,3) 
40 S=82(79,89) Mp=8 99.3 Tp=28 B7 = 60/24 P=APRTVALTA 

genpept PR=>gi | 3227 6 | emb | CAA2687 1 . 1 | (X03067) precursor [Homo 
sapiens] ] POS=9 (SEQ ID NO: 29) 
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G=2055: Mg= 927.3: Tg=31+-1 #$+(2,1) #H+(8,4) #1+0,2) 
S=94{97,90) Mp=927.4 Tp=29 B7=120/25 P=SPKLPVSSL 

genpept PR=>gi | 3399674 I gb | AAC28 918 . 1 I (AC005391) DRIL1 DNA binding 
protein homolog, partial CDS [Homo sapiens] POS=372 (SEQ ID NO: 30) 

5 

G=2688: Mg- 989.3: Tg=41+-0 #H+(5,3) 

S=90(94,81) Mp=989.4 Tp=39 B7=120/28 P-KPSLPFTSL 

genpept PR=>gi | 2198 90 | db j | BAA01857 . 1 | (D11086) interleukin 2 

receptor gamma chain [Homo sapiens] POS=3 { SEQ ID NO: 31) 

10 

G=2769: Mg= 999.5: Tg=35+-1 #H+(5,3) #I+(8,4) #J+(5,4) 
S=82(83,81) Mp= 999. 5 { 0.0) Tp= 39 A2= 5/18 P=LVMAPRTVL 
genpept PR=>gi | 9738918 | gb | AAF97847 . 1 | (AF129293) MHC class I antigen 
[Homo sapiens] POS=2 (SEQ ID NOs : 32 and 118) 

15 

G=3249: Mg=1050.4: Tg-36+-l #I+(12,5) 

S=96(99,90) Mp=1050.5 Tp=36 B7= 80/22 P=KPAFFAEKL 

genpept PR=>gi | 12654863 | gb (AAH01275 . 1 I AAH01275 (BC001275) annexin Al 
[Homo sapiens] ] POS=274 { SEQ ID NO: 33) 

20 

G=3329: Mg=1062.4: Tg=29+-0 #H+(6,3) 

S=89(89,89) Mp=1062.5( 0.1) Tp= 27 A2=0.0/9 P=IVKWDRDM 

genpept PR=>gi | 179318 I gb I AAA518 11 . 1 I (M17987) beta-2-microglobulin 
[Homo sapiens] POS=112 (SEQ ID NO: 162) 

25 

G-3426: Mg=1075.4: Tg=35+-1 #J+(4,3) 

S=78(78,81) Mp=1075.6 Tp=28 B7= 80/20 P=SPYQNIKIL 

genpept PR=>gi | 4164136 | gb | AAD08634 . 1 | (U53331) spermidine 

aminopropyltransferase [Homo sapiens] POS=128 (SEQ ID NO: 34) 

30 

G=3601: Mg=1104.5: Tg=ll+-5 #H+(4,2) #I+(8,4) 

S=89(92,83) Mp=1104.4 Tp=17 B7= 36/18 P=AASKERSGVSL 

genpept PR=>gi I 184072 | gb I AAA6318 6 . 1 | (M60747) histone HI [Homo 
sapiens] POS=50 (SEQ ID NO: 35) 

35 

G-3630: Mg=1114.4: Tg=48+-1 #S+(3,1) #H+(6,3) #I+(12,4) #J+(6,4) 
S=88(87,91) Mp=1114.4 Tp=51 B7=240/22 P=APFEPLASGIL 

genpept PR=>gi I 37 183 | emb | CAA26902 . 1 | (X03124) precursor [Homo 

sapiens] ] POS=2 (SEQ ID NO: 36) 

40 

G=3794: Mg =1194.5: Tg=47+-0 #S+(3,D #J+(4,2) 



It-faBdLXjgJB) Mp=1194.5 Tp=4 9 B7=360/31 P^APSGSLAVPLAVL 

genpept PR=>gi j 5262492 | emb | CAB45700 . 1 I (AL080080) hypothetical 

protein [Homo sapiens] POS = 9 (SEQ ID NO: 37) 



5 G=1002: B7 Mg= 754.3 Tg=33+-1 #I+(10,5) 
S=89(85,99) Mp=754.4 Tp=40 

genpept PR=>gi | 7 02332 9 I dbj I BAA9192 6.il (AK001821) 
product [Homo sapiens] 
POS=103 (SEQ ID NO: 205) 

10 

G=1007: B7 Mg= 755.4 Tg=39+-0 #H+(8,4) 
S=92(90,99) Mp=755.6 Tp=38 

genpept PR=>gi | 6807942 | emb | CAB7 0724 . 1 | (AL137396) 
protein [Homo sapiens] POS=29 ( SEQ ID NO: 206) 

15 

G=1062: B7 Mg= 769.4 Tg-28+-l #I+(14,4) #J+(3,2) 
S=79(78,84) Mp=769.4 Tp=29 B7=>lk/28 P=APRTVLL 

genpept PR=>gi | 2077 994 I emb | CAA734 7 4 . 1 | (Y13029) HLA-BM012 alphal & 
alpha2 domain [Homo 
20 sapiens] POS=5 (SEQ ID NO: 207) 



P=FFAEQL 
unnamed protein 



P=LRLQLL 
hypothetical 



G=1237: B7 Mg= 810.4 Tg=31+-3 #I+(11,5) 

S=81(84,74) Mp=810.2 Tp=22 B7=800/25 P=GPRSPSPL 

genpept PR=>gi | 12328443 | dbj j BAB21111 . 1 | (AB054538) PAPA-1 [Homo 
25 sapiens] POS=81 (SEQ ID NO:208) 



G=1261: A2 Mg= 815.2 Tg=53+-1 #F+(1,1) #G+(5,4) #K+(1,1) 
S=78(81,74) Mp=815.5 Tp=49 A2=285/27 P=GLGPVFLL 

genpept PR=>gi | 5084 92 I gb I AAA217 18 . 1 I (L14848) MHC class I-related 
30 protein [Homo sapiens] POS=2 (SEQ ID NO:209) 

G-1288: B7 Mg- 823.4 Tg=56+-4 #I+(1,1) #J+(9,4) 

S=74(69,86) Mp=823.5 Tp=51 A2= 84/29 P=LLPLIAAL 

genpept PR=>gi | 104 32829 | dbj | BAB13855 . 1 | (AK021613) unnamed protein 
35 product [Homo sapiens] POS=219 (SEQ ID NO: 210) 

G=1294: B7 Mg= 824.4 Tg=35+-1 #I+(6,3) 

S=80(85,69) Mp=824.4 Tp-38 B7=240/27 P=APAAVALVL 

genpept PR=>gi | 8 624 13 I gb | AAA82236 . 1 1 (U26727) a frameshift between 
40 exon 1 (0.18) and exon 2 changed the ORF of pl6INK4 gene [Homo 
sapiens] POS=39 (SEQ ID NO: 211) 

G=1361: B7 Mg= 838.3 Tg=32+-0 #H+(6,3) 
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S=85(85,88) Mp=838.3 Tp=31 B7=240/25 P=APAVTPAVL 

qenpept PR=>gi I 12 697 94 7 | dbj | BAB2 17 92 . 1 | (AB051488) KIAA1701 protein 
[Homo sapiens] POS=246 (SEQ ID NO: 212) 

5 G=1372: B7 Mg= 840.4 Tg=36+-1 #J+(8,4) 

3-78(82,70) Mp=840.5 Tp=40 B7= 80/20 P=FPHLITL 

genpept PR=>gi | 337 481 I gb I AAA60278 . 1 | (M34353) transmembrane 

tyrosine-specif ic protein kinase precursor [Homo sapiens] POS=1153 
(SEQ ID NO:213) 

10 

G=1397: B7 Mg= 844.5 Tg=41+-1 #J+(3,2) 

S=83 {77, 99) Mp=844 .5 Tp=41 B7= 80/22 P=KPFLGIGL 

genpept PR=>gi | 1100994 | gb| AAA828 69 . 1 | (L39793) nuclear factor p97 
[Homo sapiens] POS=651 (SEQ ID NO: 214) 

15 

G=1465: B7 Mg= 856.2 Tg=38+-2 #I+(6,4) 
S=94 (97, 89) Mp=856.3 Tp=36 A2= 1/18 

genpept PR=>gi I 297904 I emb I CAA51360 . 1 | (X72841) IEF 
sapiens] POS=102 (SEQ ID NO:215) 

20 

G=1473: B7 Mg= 858.3 Tg=37+-1 #H+(6,3) #I+(7,4) #J+(3,2) 
S=80(73,99) Mp=858.3 Tp=34 B7=240/25 P=APYGGPIAL 

genpept PR=>gi | 1214 02 90 | emb I CAC2 14 65 . 1 | (AL161656) bA12M19.2. 1 

(vacuolar protein sorting protein 16 (VPS16) ) [Homo sapiens] POS=41 
25 (SEQ ID NO:216) 

G=1535: B7 Mg= 867.5 Tg=30+-0 #H+(3,2) 
S=73(63,99) Mp=867.5 Tp=32 B7=240/25 

genpept PR=>gi | 6685007 | gb I AAF237 55 . 1 I AF1 984 54_1 

30 epithelial protein lost in neoplasm beta [Homo sapiens] 
ID NO:217) 

G=1656: B7 Mg= 882.3 Tg=30+-1 #H+(4,2) 
S=69(66,79) Mp=882.2 Tp=29 B7= 80/23 P=TPAPVPTSL 

35 genpept PR=>gi I 104 4 04 02 | db j I BAB15734 . 1 I (AK024444) FLJ00034 protein 
[Homo sapiens] POS=1178 (SEQ ID NO:218) 

G=1663: B7 Mg= 882.5 Tg=37+-1 #I+(7,4) #J+(10,4) 

S=84(87,77) Mp=882.5 Tp=37 B7=>lk/29 P=APRTVLLL 

40 genpept PR=>gi | 51 1780 | gb | AAA19924 . 1 | (U11264) HLA-B71 [Homo sapiens] 
POS=5 (SEQ ID NO:219) 

G=1736: B7 Mg= 895.2 Tg=34-f-0 #H+(5,3) 



P=GEFGGFGSV 
7442 [Homo 



P=APIAKVGVL 
(AF198454) 
POS=496 (SEQ 
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S=77(73,89) Mp=895.2 Tp=34 B7=120/26 P=NPASPPLSL 

qenpept PR=>gi I 1215669 I gb I AAC50391 . 1 | (U24169) JTV-1 [Homo sapiens] 
POS=127 ( SEQ ID NO: 220) 



5 G=1760: B7 Mg= 897.4 Tg=36+-3 #I+(8,5) 

S=74 (74,77) Mp=897.5 Tp=34 A2 = 23/24 P=AMLHDVVL 

genpept PR=>gi | 292033 I gb I AAA35854 . 1 J (L00635) farnesyl-protein 
transferase beta-subunit [Homo sapiens] POS=380 (SEQ ID NO: 221) 

10 G=1834: B7 Mg= 904.4 Tg=27+-0 #H+(3,2) 

S=80(73,99) Mp=904.4 Tp-25 B7= 18/23 P=APVLPHTAV 

genpept PR=>gi | 6599151 I emb | CAB63721 . 1 I (AL133572) hypothetical 
protein [Homo sapiens] POS=525 (SEQ ID NO: 222) 

15 G=1877: A2,B7 Mg= 908.4 Tg=29+-2 #D+(1,1) #E+(20,7) #F+<4,3) 
#EST+(2,1) #FR+(7,2) #G+(4,3) #I+(33,5) #J+<5,3) #K+(1,1) 
S=77(84,63) Mp=908.2 Tp=24 P-LPPPPPPGH 

genpept PR=>gi | 12 6532 65 I gb I AAH004 01 . 1 i AAH00401 (BC0004 01) splicing 
factor 3b, subunit 2, 

20 145kD [Homo sapiens] POS-17 (SEQ ID NO: 223) 

G=1918: B7 Mg= 912.4 Tg=30+-3 #I+(13 / 5) #J+(6,3) 

S=81(82,79) Mp=912.4 Tp=30 B7= 80/23 P=NPASKVIAL 

genpept PR=>gi t 13597 19 | emb I CAA64752 . 1 | (X95486) clathrin heavy chain 
25 polypeptide [Homo sapiens] POS=74 (SEQ ID NO: 224) 

G=1932: A2,B7 Mg= 913.4 Tg=32+-1 #FR+(2,1) #I+(4,2) 
S=92(90,99) Mp=913.5 Tp=34 P=KSFKLSGF 

genpept PR=>gi | 2198 94 | db j j BAA01392 . 1 | (D10522) 80K-L protein [Homo 
30 sapiens] POS=162 (SEQ ID NO: 225) 

G=1948: A2 Mg= 915.5 Tg=38+-0 #D+(2,1) #F+(11,7) 

S=87(96,69) Mp=915.5 Tp-37 A2=160/28 P=TLGNVLVTV 

genpept PR->gi | 7 58 22 96 I gb I AAF64 2 68 . 1 I AF2 08 8 54_1 (AF208854) BM-012 
35 [Homo sapiens] POS=91 (SEQ ID NO: 226) 

G=1992: A2 Mg= 920.4 Tg=45+-0 #E+(1,1) #F+(4,3) #FR+(1,1) 
S=77(82,66) Mp=920.5 Tp=4 3 A2-257/29 P=ALLAYTLGV 

genpept PR=>gi j 927067 | gb | AAC0938 6 . 1 I (L41498) longation factor 
40 1-alpha 1 [Homo sapiens] POS-73 (SEQ ID NO: 227) 



G=1995: B7 Mg= 920.7 Tg=17+-0 #I+(3,2) 
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S=72(74,69) Mp=920.4 Tp=21 B7= 6/22 P=LPKPPGRGV 

qenpept PR=>gi | 104 39422 | dbj | BAB154 99 . 1 I (AK026541) unnamed protein 
product [Homo sapiens] POS=335 (SEQ ID NO: 228) 

5 G=1998: A2 Mg= 921.4 Tg=53+-0 #D+(1,1) #E+(6,3) #F+(5 / 3) 

#EST+(2,2) #FR+(2,1) #G+(20,11) 

S=73(72,77) Mp=921.5 Tp=53 B7= 20/12 P=FVFPGELL 

genpept PR=>gi j 1478281 | gb I AAC50629 . 1 I (U53347) neutral amino acid 
transporter B [Homo sapiens] POS=89 {SEQ ID NO: 229) 

10 

G=1999: B7 Mg= 921.4 Tg=31+-1 #H+(3,2) #I+(6,3) #J+(1,1) 
S=75(82,59) Mp=921.3 Tp-31 B7=240/24 P=APAPRPSLL 

genpept PR=>gi | 3123906 I gb I AAC39729 . 1 I (AF038391) pre-mRNA splicing 
factor [Homo sapiens] POS=41 (SEQ ID NO:230) 

15 

G=2062: A2 Mg= 928.4 Tg=49+-0 #K+(4,3) 

S=75<74,79) Mp=928.4 Tp-50 A2 = 79/24 P=SLLAAPIML 

genpept PR=>gi | 6715510 I gb j AAF2 64 4 4 . 1 |AF220530_1 (AF220530) 

myo-inositol 1-phosphate synthase Al [Homo sapiens] POS=420 (SEQ ID 
20 NO:231) 

G=2066: B7 Mg= 929.4 Tg=35+-1 #H+(6,3) #I+(8,5) #J+(6,3) 
S=84(82,89) Mp=929.4 Tp=34 B7= 6/24 P=SPRLPVGGF 

genpept PR=>gi j 4 60711 | dbj | BAA05837 . 1 | (D28476) KIAA0045 [Homo 
25 sapiens] POS=1921 (SEQ ID NO: 232) 

G=2067: B7 Mg= 929.4 Tg=28+-0 #H+(3,2) 

S=83(80,90) Mp=928.3 Tp=25 B7= 30/20 P=GPI YPGHGM 

genpept PR=>gi j 13445484 | gb | AAK2 624 9 . 1 I AF212226_1 (AF212226) RPL24 
30 [Homo sapiens] POS=ll (SEQ ID NO: 233) 

G=2068: A2 Mg= 929.5 Tg=43+-1 #E+<2,2) #F+(20,7) #FR+(1,1) 
#G+(36,11) #K+(24,6) #L+(3,2) 

S=84 (89,74) Mp=929.5 Tp=45 A2= 11/22 P=SLVIGSILGA 

35 genpept PR=>gi I 228 1008 j dbj I BAA21560 . 1 I (D83492) Eph-family protein 
[Homo sapiens] POS=579 (SEQ ID NO: 234) 

G=2079: B7 Mg= 931.3 Tg=23+-1 #H+(4,2> 

S=77(87,54) Mp=931.3 Tp-25 B7=>lk/28 P=APRPAGSYL 

40 genpept PR=>gi | 27 07 610 | gb | AAB92363 . 1 | (AF000003) MHC class II 

transactivator type III [Homo sapiens] POS=5 (SEQ ID NO: 235) 



G=2082: B7 Mg= 932.2 Tg=36+-0 #H+<5,3) 



10 



15 
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35 



40 
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S=81(82,81) Mp=932.4 Tp=34 B7= 4/20 P=VPYGTPLSV 

qenpept PR=>gi | 1804 8 4 | gb | AAA98 113 . 1 | (M16541) cholinesterase (EC 
3.1.1.8) [Homo sapiens] POS-308 (SEQ ID NO:236) 

G=2083: A2 Mg= 932.3 Tg=56+-0 #D+(1,1) #F+<4,2) 

S=74(77,69) Mp=932.4 Tp=52 A2= 79/26 P = AL FG I PMAL 

genpept PR=>gi | 13623625 | gb | AAH06432 . 1 | AAH06432 (BC006432) Similar to 
caveolin 1, caveolae protein, 22kD [Homo sapiens] POS=74 (SEQ ID 
NO:237) 

G-2114: B7 Mg= 935.4 Tg=29+-0 #J+(4,2) 

S=74(68,88) Mp=935.4 Tp=30 B7= 80/21 P=SPNKLYTL 

genpept PR=>gi | 36130 | emb | CAA34 066 . 1 | (X15940) ribosomal protein L31 
(AA 1-125) [Homo sapiens] POS=98 ( SEQ ID NO: 238) 

G=2159: B7 Mg= 940.4 Tg-21+-2 #H+(8,3) #I+(12,4) #J+(5,2) 
S=86(85,89) Mp=940.3 Tp=26 B7- 90/28 P=APRQPGLMA 

genpept PR=>gi | 5531805 | gb | AAD4 4 4 77 . 1 | (AF078845) 16.7Kd protein [Homo 
sapiens] POS=49 (SEQ ID NO: 239) 

G=2161: B7 Mg= 940.5 Tg=29+-0 #H+(3,2) 

S=78(73,90) Mp=940.5 Tp-22 B7=240/24 P=APHLVGPHL 

genpept PR=>gi | 4 92 967 9 | gb | AAD34100 . 1 | AF1518 63_1 (AF1518 63) CGI- 105 
protein [Homo sapiens] POS=34 (SEQ ID NO:240) 

G=2167: A2 Mg= 941.3 Tg=36+-1 #G+(6,4) 

S=77(84,63) Mp=941.4 Tp=34 A2= 70/27 P=SLIPTSPQV 

genpept PR->gi I 7022115 | dbj | BAA914 93 . 1 | (AK001073) unnamed protein 
product [Homo sapiens] POS=238 (SEQ ID NO:241) 

G-2171: B7 Mg= 941.5 Tg=30+-1 #I+(1,1) #J+(3,2) 

S=76(72,86) Mp=941.5 Tp=41 B7- 8/19 P=VPKGWEII 

genpept PR=>gi | 10504249 I gb j AAG18012 . 1 I (AF248269) gag-pro-pol 

precursor protein [Homo sapiens] POS=100 (SEQ ID NO: 242) 

G=2190: B7 Mg= 943.4 Tg=41+-0 #I+(4,3) #J+(2,2) 

S=76(76,77) Mp=943.4 Tp=38 B7=800/28 P=SPRGFPLGL 

genpept PR=>gi j 7 959177 | dbj I BAA95982 . 1 I (AB040891) KIAA1458 protein 
[Homo sapiens] POS=94 (SEQ ID NO: 243) 

G=2200: B7 Mg= 944.3 Tg=25 + -1 #H+(5,3) #I+(7,3) #J+(8,2) 
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S=81(81,81) Mp-944.3 Tp-28 B7=240/26 P=APSRNGMVL 

genpept PR=>gi | 134 36332 \ gb | AAH04 954 . 1 | AAH04 954 (BC004954) Unknown 
(protein for MGC: 10897) [Homo sapiens] POS=2 (SEQ ID NO:244) 

5 G=2240: A2 Mg= 948.2 Tg=51+-0 #D+(1,1) #E+(2,1) #F+(4,3) 

#G+(7,5) #K+(1,1) 

S=65(66,63) Mp=948.4 Tp=52 A2=134/26 P=MLFPGS I AL 

genpept PR=>gi | 5748523 | emb | CAB53072 . 1 | (AL035071) dJ1085F17.2 

(microtubule-associated protein, 
10 RP/EB family, member 1 ) [Homo sapiens] POS=50 (SEQ ID NO: 245) 

G=2258: B7 Mg= 950.4 Tg=29+-1 #H+(5,4) #I+(3,2) #J+(3,2) 
3=85(83,90) Mp=950.5 Tp=28 B7=>lk/31 P=APRVPVQAL 

genpept PR=>gi | 12060822 | gb | AAG4 8253 . 1 | AF308285_1 (AF308285) 

15 serologically defined breast cancer antigen NY-BR-16 [Homo sapiens] 
POS=744 (SEQ ID NO:246) 

G=2286: B7 Mg= 953.4 Tg=36+-0 #H+(9,3) 

S=73(78,63) Mp=953.5 ' Tp=40 A2= 0/20 P=LLLPGELAK 

20 genpept PR=>gi | 30804 66 | emb I CAB11426 . 1 I (Z98744 ) histone H2B [Homo 
sapiens] POS=100 (SEQ ID NO: 247) 

G=2375: B7 Mg= 961.0 Tg=52+-1 #H+(4,3) 
S=80(82,77) Mp-961.5 Tp=60 

25 genpept PR->gi I 7022187 | dbj | BAA91513 . 1 | (AK001123) 
product [Homo sapiens] POS=45 (SEQ ID NO: 248) 

G=2408: B7 Mg= 964.5 Tg=33+-1 #I+(2,2) 

S=76(78,74) Mp=964.3 Tp-30 B7=800/25 P=GPRAPGPSLL 

30 genpept PR=>gi | 1945762 j emb | CAA68 877 . 1 | (Y07604) 

nucleoside-diphosphate kinase [Homo sapiens] POS=16 (SEQ ID NO: 249) 

G=2450: B7 Mg= 968.5 Tg=36+-1 #I+(10,5) #J+(6,3) 

S=80(82,76) Mp=968.4 Tp=33 B7=800/24 P=GPRTAALGLL 

35 genpept PR=>gi | 134 3 6152 | gb | AAH04 8 92 . 1 | AAH04 892 (BC004 8 92) 

reticulocalbin 2, EF-hand calcium binding domain [Homo sapiens] 
POS=4 (SEQ ID NO:250) 

G=2463: B7 Mg= 969.4 Tg=29+-2 #I+(9,4) #J+(2,1) 
40 S=77 (72, 90) Mp=969.5 Tp=30 B7= 4/19 P=S PNQKLLAV 

genpept PR=>gi | 5639663 | gb | AAD4 58 65 . 1 I AF083217_1 (AF083217) WD repeat 
protein WDR3 [Homo sapiens] POS=558 (SEQ ID NO: 251) 



P=FFSVFMAL 
unnamed protein 
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G=2465: A2 Mg= 969.5 Tg=39+-1 #E+(2,1) #FR+(2 / l) #G+(6,4) 
S=67(67,69) Mp=969.5 Tp=41 A2 = 84/29 P=LLASEVPQL 

genpept PR=>gi | 64 53540 1 emb | CAB61405 . 1 | (AL133088 ) hypothetical 
protein [Homo sapiens] POS=196 (SEQ ID NO: 252) 

5 

G=2476: B7 Mg= 970.5 Tg=20+-2 #H+(1,1) #I+(5,3) #J+(2 / l) 
5=71(71,72) Mp=970.3 Tp=23 B7= 2/23 P-APKRPPSAF 

genpept PR=>gi | 13097234 | gb| AAH0337 8 . 1 I AAH0337 8 (BC00337 8) 

high-mobility group (nonhistone chromosomal) protein 1 [Homo sapiens] 
10 POS=94 (SEQ ID NO:253) 

G=2483: B7 .Mg= 971.3 Tg=37+-1 #I+(4,2) #J+(2,1) 

S=85(80,99) Mp=971.4 Tp=35 B7= 1/23 P=VPAEPKLAF 

genpept PR=>gi | 4 17 6369 | gb | AAD0884 6 . 1 I (AC005058) similar to 60S 
15 ribosomal protein L7 ; similar to P18124 ( PID: dl3302 1 ) [Homo sapiens] 
POS=82 (SEQ ID NO:254) 

G=2486: B7 Mg= 971.5 Tg=49+-2 #J+(8,4) 

S=83(90,69) Mp=971.5 Tp=45 B7=240/25 P=APARLFALL 

20 genpept PR=>gi | 28 6021 | dbj | BAA02550 . 1 | (D13292) ryudocan core protein 
precursor [Homo sapiens] POS=2 (SEQ ID NO: 255) 

G=2514: A2 Mg= 973.4 Tg=33+-1 #F+(5,3) #G+(10,6) #K+(4,2) 

#L+ (1, 1) 

25 S=81{89,63) Mp=973.5 Tp=32 A2=344/27 P=KLGEIVTTI 

genpept PR=>gi I 178 983 I gb I AAA35552 . 1 | (M36340) ADP-ribosylat ion factor 
(ARF1) [Homo sapiens] POS=38 (SEQ ID NO: 256) 

G=2527: A2 Mg= 974.3 Tg=50+-0 #D+(1,1) #E+(15,7) #F+(9,4) 

30 #G+(22,10) #K+(10,5) 

S=86(93,72) Mp=974.5 Tp=51 B7= 6/15 P=MVDGTLLLL 

genpept PR=>gi | 306852 I gb | AAA52655 . 1 | (M20022) HLA-E class I protein 
precursor [Homo sapiens] POS=l (SEQ ID NO: 257) 

35 G=2550: B7 Mg= 976.3 Tg=27+-0 #H+(2,1) 

S=65(67,63) Mp=976.3 Tp=23 B7=120/27 P=TPSEPHPVL 

genpept PR=>gi | 128034 69 j gb | AAH0255 9 . 1 I AAH0255 9 (BC00255 9) 

high-glucose-regulated protein 8 [Homo sapiens] POS=381 (SEQ ID 
NO:258) 

40 



G=2591: B7 Mg= 980.3 Tg=29+-2 #H+(8,3) #I+(41,6) #J+(6,4) 
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S=82(92,60) Mp=980.1 Tp=28 B7= 7/21 P=APDAAPAPASI 

genpept PR=>gi | 12 652955 | gb I AAH00238 . 1 | AAH00238 (BC000238) 

hypothetical protein FLJ10415 [Homo sapiens] POS=4 (SEQ ID NO: 259) 

5 G=2632: A2 Mg= 984.2 Tg=40+-0 #F+(4,3) 

S=68(75,54) Mp-984.4 Tp=37 A2=656/32 P=ALPEDLVEV 

genpept PR=>gi | 190030 | gb | AAB0284 5 . 1 | (M22300) L-plastin polypeptide 
[Homo sapiens] POS=542 (SEQ ID NO:260) 

10 G=2633: B7 Mg= 984.4 Tg=25+-0 #H+(9,2) 

S=87(87,88) Mp=984.4 Tp=23 P=GSHSMRYF 

genpept PR=>gi | 1261808 I gb I AAC4 1 97 9 . 1 I (L41925) major 

histocompatibility complex [Homo sapiens] POS=25 (SEQ ID NO: 261) 

15 G=2659: B7 Mg= 986.5 Tg=33+-1 #I+(6,3) #J+(4,2) 

5=83(89,72) Mp=986.4 Tp=32 B7=360/27 P=APASPFRQL 

genpept PR=>gi | 338043 I gb I AAA73055 . 1 | (M69039) [Human pre-mRNA 
splicing factor SF2p32, complete sequence.], gene product [Homo 
sapiens] POS=19 (SEQ ID NO:262) 

20 

G=2679: A2 Mg= 988.3 Tg=33+-1 #E+(2,1) #FR+(2,2) 

S=78(83,69) Mp=988.4 Tp=32 P=NIKFVPAEA 

genpept PR=>gi | 7023874 I db j | BAA9211 6 . 1 I (AK002163) unnamed protein 
product [Homo sapiens] POS=79 (SEQ ID NO: 263) 

25 

G=2683: A2 Mg= 988.4 Tg=31+-0 #EST+(2,1) 

S=72(73,72) Mp=988.4 Tp=45 A2=118/25 P=MLAALNGLSV 

genpept PR=>gi | 8 0 902 9 | emb I CAA57 993 . 1 I (X82676) tyrosine phosphatase 
[Homo sapiens] POS=866 (SEQ ID NO:264) 

30 

G=2706: B7 Mg= 990.5 Tg=17+-1 #H+(5,2) #I+(6,4) 

5=81(89,63) Mp=990.2 Tp=20 B7=900/27 P=APRPPPKPM 

genpept PR=>gi | 1280354 9 I gb I AAH02 604 . 1 IAAH02604 (BC002604) ribosomal 
protein S26 [Homo sapiens] POS=107 (SEQ ID NO:265) 

35 

G=2723: B7 Mg= 993.4 Tg=34+-1 #I+(5,3) #J+(2,1) 

S=83{85,79) Mp=993.4 Tp=30 B7= 80/21 P=SPNAEIHIL 

genpept PR=>gi | 3582442 | dbj I BAA33063 . 1 I (AB017335) kinesin-like DNA 
binding protein [Homo sapiens] POS=542 (SEQ ID NO: 266) 



G=2741: B7 Mg= 995.6 Tg=46+-1 #H+(1,1) #I+(11,5) #J+(13,4) 
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S=79(87,63) Mp=995.6 Tp=44 B7=>lk/28 P=APRTVLLLL 

genpept PR=>gi j 30722 1 | gb | AAA3 62 30 . 1 | (M32317) HLA protein allele B7 
[Homo sapiens] POS=5 (SEQ ID NO: 267) 

5 G=2781: A2 Mg=1000.4 Tg=50+-0 #D+(2,2) #E+(7,4) #F+(5,4) 

#EST+(3,2) #FR+(4,2) #G+(6,5) #K+(10,6) 

S=69(69,72) Mp=1000.5 Tp=49 A2-238/26 P=FVLPELPSV 

genpept PR=>gi | 12652733 | gb i AAH00116 . 1 | AAH00116 (BC000116) Similar to 
KIAA0174 gene product [Homo sapiens] POS-292 (SEQ ID NO: 268) 

10 

G=2799: A2 Mg=1002.3 Tg=53+-0 #E+(9,5) #F+(1,1) #EST+(3,2) 

#FR+(5,2) #G+(2,2) #K+<1,1) 

S=82(89,66) Mp=1002.5 Tp=54 A2= 28/17 P=FSNFIFEV 

genpept PR=>gi I 36559 | emb I CAA35995 . 1 I (X51698) spasmolytic polypeptide 
15 [Homo sapiens] POS=109 (SEQ ID NO:269) 

G=2826: B7 Mg=1004 . 4 Tg=32+-0 #I+(2 / l) 

S=67(69,63) Mp=1004.4 Tp=29 B7= 72/24 P=APEEHPVLL 

genpept PR=>gi | 12803203 | gb I AAH024 09 . 1 | AAH02409 (BC002409) actin, beta 
20 [Homo sapiens] POS=97 (SEQ ID NO: 270) 

G=287 9: B7 Mg=1010. 4 Tg=36+-1 #1+ (l f 1) #J+ (5, 4) 

S=83(86,79) Mp=1010.5 Tp=36 B7- 0/20 P=SPKGKFSLF 

genpept PR=>gi | 8 97824 j gb I AAA69898 . 1 I (M80899) AHNAK gene product 
25 [Homo sapiens] POS=1150 (SEQ ID NO: 271) 

G=3016: A2 Mg=1024.4 Tg=42+-1 #E+{1,1) #F+(2,1) #EST+(1,1) 

#G+(4,3) #K+(9,5) 

5=76(75,81) Mp=1024.5 Tp=36 A2= 98/27 P=ALHDILTEI 

30 genpept PR=>gi | 14 98257 | gb | AAB09784 . 1 | (L07540) replication factor C, 
36-kDa subunit [Homo sapiens] POS=276 (SEQ ID NO: 272) 

G=3019: B7 Mg=1024 . 6 Tg=34+-2 #I+(6,3) #J+(8,4) 
S=87(90,81) Mp=1024.6 Tp=35 B7= 80/23 

35 genpept PR=>gi I 1401055 I gb I AAB18675 . 1 I (U38817) SUPT4H 
POS=93 (SEQ ID NO:273) 

G=3033: A2 Mg=1025.4 Tg=55+-0 #F+(1,1) #G+(3,2) 
S=79(79,81) Mp=1025.6 Tp=51 A2=>lk/26 P=SLIDQFFGV 

40 genpept PR=>gi | 13097 696 ( gb I AAH03556 . 1 I AAH03556 (BC003556) ubiquitin 
specific protease 14 ( tRNA-guanine transglycosylase ) [Homo sapiens] 
POS=241 (SEQ ID NO:274) 



P=LPQGIVREL 
[Homo sapiens] 
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G=3046: A2 Mg=1026.6 Tg=48+-0 #G+(1,1) #K+(1,1) 

S=67(58,90) Mp=1026.6 Tp=49 A2=437/24 P=MLFGHPLLV 

genpept PR=>gi | 12 653165 I gb I AAH00350 . 1 | AAH00350 (BC000350) ubiquitin 
specific protease 11 [Homo sapiens] POS=338 { SEQ ID NO: 275) 

5 

G=3102: A2 Mg=1032.4 Tg=52+-1 #E+(31,9) #F+(7,4) #EST+(5,2) 

#FR+(8,3) #G+(30,11) #K+(28,6) #L+(7,2) 

S=80(80,81) Mp=1032.4 Tp=47 A2=166/26 P=ALPEI FTEL 

genpept PR=>gi | 306900 | gb I AAA19696 . 1 | (L19161) translation initiation 
10 factor eIF-2 gamma subunit [Homo sapiens] POS=363 (SEQ ID NO: 276) 

G=3151: B7 Mg=1038.3 Tg=43+-0 #H+(4,3) #I+(1,1) 

3=78(69,99) Mp-1038.4 Tp=42 B7=120/25 P=TPWQPPTVL 

genpept PR=>gi I 7021021 | dbj I BAA91355 . 1 | (AK000742) unnamed protein 
15 product [Homo sapiens] POS=348 ( SEQ ID NO: 277) 

G=3155: B7 Mg=1038.5 Tg=33+-1 #I+(1,1) #J+(2,1) 

S=70(66,81) Mp=1038.4 Tp=35 B7=>lk/27 P=LPRQPPMSL 

genpept PR=>gi | 4 4 0177 | gb I AAC03568 . 1 I (U01184) flightless-I homolog 
20 [Homo sapiens] POS-881 (SEQ ID NO: 278) 

G=3174: B7 Mg=1040.5 Tg=42+-1 #J+(6,3) 

S=74(68,90) Mp=1040.5 Tp=42 B7= 80/22 P=SPNLRLLDL . 

genpept PR->gi I 104 39422 | dbj | BAB154 99 . 1 I (AK026541) unnamed protein 
25 product [Homo sapiens] POS-377 (SEQ ID NO: 279) 

G=3178: A2 Mg=1040.5 Tg=22+-0 #F+(3,2) 

S=71(68,81) Mp=1040.5 Tp=29 A2=485/27 P=ILHDDEVTV 

genpept PR=>gi | 13097207 | gb I AAH03369 . 1 | AAH03369 (BC003369) ribosomal 
30 protein, large, PI [Homo sapiens] POS=15 (SEQ ID NO: 280) 

G=3182: A2 Mg=1041.3 Tg=69+-0 #FR+(2,1) 

S=65(62,72) Mp=1040.8 Tp=71 A2=343/30 P=FLLPLI IVL 

genpept PR=>gi | 7271191 I emb | CAB77667 . 1 | (AJ249778) mrl protein [Homo 
35 sapiens] POS=7 (SEQ ID NO:281) 

G=3200: B7 Mg=1043.5 Tg=32+-0 #H+(1,1) #J+(4,2) 

S=76(78, 72) Mp=1043.4 ' Tp=34 B7= 24/22 P=LPDERTISL 

genpept PR=>gi | 180599 | gb | AAA35691 . 1 I (L02547) cleavage stimulation 
40 factor [Homo sapiens] POS=370 (SEQ ID NO: 282) 

G=3212: A2 Mg=1045.4 Tg=44+-0 #K+(6,4) 
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S=79{91,54) Mp=1045.5 Tp=55 A2 = 89/25 P=SLLLNMLEI 

qenpept PR=>gi | 5381289 I gb I AAD42925 . 1 | (AF075290) gap-junction protein 
alpha 3 [Homo sapiens] POS=214 (SEQ ID NO: 283) 

5 G=3221: B7 Mg=1047.3 Tg=34+-0 #H+(2,1) 

S=75(77,72) Mp=1046.5 Tp=36 B7=120/24 P=LPAWPHRGL 

genpept PR=>gi j 7022196 I dbj | BAA91515 . 1 | (AK001129) unnamed protein 
product [Homo sapiens] POS=387 (SEQ ID NO: 284) 

10 G=3236: A2 Mg=1049.4 Tg=67+-2 #D+(16,5) #E+(9,6) #F+(11,5) 

#EST+(2,1) #FR+(4,3) #G+(11,5) #K+(5,3) 

S=69(68,72) Mp=104 9.6 Tp=67 A2=525/27 P=ALWGFFPVL 

genpept PR=>gi | 12005 4 95 | gb I AAG44 477 . 1 |AF24 272 9_1 (AF2 4 272 9) HT022 
[Homo sapiens] POS=4 ( SEQ ID NO: 285) 

15 

G=3289: B7 Mg=1056.5 Tg=35+-1 #I+(6,3) #J+(6,4) 

S=88(92,81) Mp=1056.6 Tp-35 B7=120/26 P=YPKRPLLGL 

genpept PR=>gi | 9931361 I gb | AAG02 169 . 1 jAF214 731_l (AF2147 31) 

ATP-dependent RNA helicase [Homo sapiens] POS=383 (SEQ ID NO: 286) 

20 

G=3300: B7 Mg=1058.4 Tg=38+-1 #I+(4,3) #J+(6,4) 
S=88(91,81) Mp=1058.5 Tp=41 B7=240/24 

genpept PR=>gi | 5 911980 | emb | CAB55960 . 1 I (AL117492 ) 
protein [Homo sapiens] POS=800 (SEQ ID NO:287) 

25 

G=3308: B7 Mg=1059.4 Tg=35+-1 #I+(2 / l) #J+(4,3) 
S=87 (86,90) Mp=1059.4 Tp=33 B7= 80/23 P^NPAENFRVL 

genpept PR->gi | 35038 | emb I CAA40736 . 1 I (X57500) nuclear factor IV [Homo 
sapiens] POS=489 (SEQ ID NO:288) 

30 

G-3344: A2 Mg=1064.5 Tg=37+-0 #F+(2,2) #K+(1,1) 

S=65(66,63) Mp=1063.7 Tp=39 A2=183/28 P=VLDDKLYVV 

genpept PR=>gi | 4894 624 | gb | AAD32565 . 1 | (AF123320) lymphocyte 

activation-associated protein [Homo sapiens] POS=19 (SEQ ID NO: 289) 

35 

G=3360: A2 Mg=1065.4 Tg=37+-1 #D+{1,1) #EST+(2,1) #FR+(3,1) 

#G+(3,3) #K+(2,1) 

S=81(82,81) Mp=1065.5 Tp=35 A2= 50/16 P=NQFPGFKEV 

genpept PR=>gi | 134 4 6037 | emb | CAB38777 . 2 I (AL034428) dJ705D16.1 (small 
40 nuclear ribonucleoprotein polypeptide B 1 1 ) [Homo sapiens] POS=172 
(SEQ ID NO:290) 



P=APFLRNVEL 
hypothetical 



G=3368: B7 Mg=1066.4 Tg=21+-3 #I+(7,2) #J+(6,3) 



99 

S=78(89,54) Mp-1066.4 Tp=25 B7=120/25 

genpept PR=>gi | 1945611 | dbj | BAA19749. 1 I (AB003103) 
subunit p55 [Homo sapiens] POS=417 (SEQ ID NO:291) 



P=RPKDPNNLL 
26S proteasome 



10 



15 



20 



G=3369: A2 Mg=1066.5 Tg=58+-1 #FR+(4,2) 

S=65(69,57) Mp=1066.7 Tp=60 A2=298/20 P=LVFDALIYI 

genpept PR=>gi | 1504 002 I db j I BAA13200 . 1 I (D86964) similar to a human 
major CRK-binding protein DOCK180. [Homo sapiens] POS=684 (SEQ ID 
NO: 2 92) 

G=3381: B7 Mg=1067.8 Tg=37+-0 #I+(2,1) #J+(1,1) 

S=72(72,72) Mp=1067.6 Tp=33 B7=120/23 P=SPHIPYKLL 

genpept PR=>gi | 568 94 63 | dbj | BAA83015 . 1 | (AB028986) KIAA1063 protein 
[Homo sapiens] POS=305 (SEQ ID NO: 293) 

G=3393: A2 Mg=1070.5 Tg=53+-0 #G+(7,5) 

S=76(74,81) Mp-1070.6 Tp=49 A2= 67/19 P=YVVDI FTTL 

genpept PR->gi | 9957 532 I gbj AAG094 01 . 1 I AF17 9353_1 (AF17 9353) inward 
rectifier potassium channel KirS.l [Homo sapiens] POS=57 (SEQ ID 
NO:294) 



G=3398: A2 Mg=1070.6 Tg=35+-0 #F+(2,1) 

5=70(78,54) Mp=1070.6 Tp=35 A2=656/27 P=RLQEELIAV 

genpept PR=>gi | 309347 6 | gb | AAC16031 . 1 | (AF008915) EVI-5 homolog [Homo 
25 sapiens] POS=506 (SEQ ID NO:295) 

G=3403: A2 Mg=1071.5 Tg=35+-1 #E+(1,1) #K+(1,1) 

S=73(67,90) Mp=1071.5 Tp=34 A2->lk/25 P=NLMEQPIKV 

genpept PR=>gi I 112288 9 | emb 1 CAA92522 . 1 I (Z68228) plakoglobin [Homo 
30 sapiens] POS=5 (SEQ ID NO: 296) 

G=3422: A2 Mg=1074.4 Tg=46+-0 #G+(7,5) #K+(1,1) 

S=72(73,72) Mp=1074.4 Tp=47 A2=128/21 P=FMLPDPQNI 

genpept PR=>gi | 12 620198 I gb | AAG60614 . 1 | AF288394_1 (AF288394) Clorf 19 
35 [Homo sapiens] POS=185 (SEQ ID NO:297) 

G=3425: A2 Mg=1075.3 Tg=29+-0 #FR+(2 / l) 

S=70(73,63) Mp=1075.7 Tp=27 A2=201/26 P=TLHDQVHLL 

genpept PR=>gi | 31234 | emb t CAA27284 . 1 | (X03635) oestrogen receptor 
40 [Homo sapiens] POS=371 (SEQ ID NO: 298) 



G=3439: A2 Mg=1076.6 Tg=52+-0 #F+(3,3) 
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S=79(89,56) Mp-1076.5 Tp=51 A2 = 46/20 P-ILSPAGQIFM 

genpept PR=>gi | 1 0 6367 0 | gb | AAA8 1368 . 1 | (U38896) zinc finger protein 
C2H2-171 [Homo sapiens] POS=324 (SEQ ID NO: 299) 

5 G-3455: A2 Mg=1078.5 Tg-42+-l #F+(1,1) #EST+(1,1) #FR+(1,1) 

3=76(71,90) Mp=1078.6 Tp=4 6 A2=403/26 P=FLSELTQQL 

genpept PR=>gi | 312334 j emb I CAA80598 . 1 | (Z23063) macrophage migration 
inhibitory factor [Homo sapiens] POS=19 (SEQ ID NO: 300) 

10 G=3456: B7 Mg=1078.5 Tg=38+-1 #H+(3,2) #J+(3,2) 

S=72(72,74) Mp=1078.5 Tp=36 B7=>lk/27 P=SPRFPAQYL 

genpept PR=>gi | 1403336 | emb i CAA67024 . 1 | (X98378) SKI2W protein [Homo 
sapiens] POS=1017 (SEQ ID NO: 301) 

15 G=3492: A2 Mg=1083.5 Tg=42+-1 #E+(3,2) #EST+(1,1) #G+(10,5) 

S=83(87,74) Mp=1083.5 Tp=43 A2 = 177/25 P=RLWGE PVNL 

genpept PR=>gi | 1666075 | emb | CAA66942 . 1 | (X98296) ubiquitin hydrolase 
[Homo sapiens] POS-1665 (SEQ ID NO: 302) 

20 G=3516: A2 Mg=1088.4 Tg=33+-0 #G+(3,2) 

S=69(67,76) Mp=1088.5 Tp=30 A2=656/29 P^TLTEEGVIKV 

genpept PR=>gi | 13160 98 8 I gb I AAK134 4 4 . 1 I AF325353_1 (AF325353) G 

protein-binding protein CRFG [Homo sapiens] POS=322 (SEQ ID NO: 303) 

25 G=3516: A2 Mg=1088.4 Tg=33+-0 #G+(3,2) 

S=69(67,76) Mp=1088.5 Tp=33 A2=656/27 P=TLTEEGVIQV 

genpept PR=>gi | 3153873 | gb | AAC24 364 . 1 | (AF065393) putative G-binding 
protein [Homo sapiens] POS=306 (SEQ ID NO:304) 

30 G=3537: A2 Mg=1092.3 Tg=39+-1 #D+(1,1) #F+(2,2) #G+(14,10) 

#K+(3,2) 

S=75(75,76) Mp=1092.5 Tp=46 A2=216/23 P=FLQEGDLISA 

genpept PR=>gi I 7023478 | dbj | BAA91977 . 1 | (AK001916) unnamed protein 
product [Homo sapiens] POS=134 (SEQ ID NO: 305) 

35 

G=3540: A2 Mg=1093.5 Tg=41+-0 #F+(2,1) 

S=65(66,63) Mp=1093.6 Tp=3 9 A2=156/27 P-KILDYEVTL 

genpept PR=>gi I 186354 | gb I AAA59155 . 1 I (M57230) membrane glycoprotein 
130 [Homo sapiens] POS=362 (SEQ ID NO: 306) 

40 

G=3550: A2 Mg=1095.3 Tg=52+-0 #G+(2,2) 



101 

S=66<60, 81) Mp=1095.7 Tp=54 A2=>lk/2 9 P=ILFKSIFEV 

qenpept PR=>gi | 65634 08 I gb I AAD37 802 . 2 I AF151980_1 (AF15198 0) connexin 
43 [Homo sapiens] POS=159 ( SEQ ID NO: 307) 

5 G=3586: B7 Mg=1102.5 Tg=36+-1 #H+(2,1) #I+(1,1) #J+(2,2) 

S=77(76,81) Mp=1102.4 Tp=31 B7=>lk/28 P-NPRIPYTEL 

genpept PR=>gi | 3220164 | gb | AAC397 69 . 1 | (AF029777) hGCN5 [Homo sapiens] 
POS=656 (SEQ ID NO:308) 

10 G=3610: A2 Mg=1108.3 Tg=44+-1 #D+(5,2) #E+(6,4) #F+(8,5) 

#EST+(1,1) #FR+{2 / l) #G+(12,8) #K+(1,1) 

S=66 (66, 66) Mp=1108.4 Tp=50 A2=178/21 P=FLADPSAFVAA 

genpept PR=>gi | 12 654 583 | gb I AAH01 127 . 1 | AAH01127 (BC001127) ribosomal 
protein, large, P0 [Homo sapiens] POS=268 (SEQ ID NO: 309) 

15 

G=3612: A2 Mg=1108.4 Tg=54+-1 #F+(4,3) #G+(2,2) 
S=66 (68, 63) Mp=1108 . 7 Tp=57 A2=656/27 

genpept PR=>gi | 799177 | gb | AAA80488 . 1 | (U22055) 100 kDa 
[Homo sapiens] POS=186 (SEQ ID NO: 310) 

20 

G=3619: B7 Mg=1110.7 Tg=29+-0 #I+(4,2) 
S=81(84,74) Mp=1110.4 Tp=25 B7= 80/24 P=RPRPDDLEI 

genpept PR=>gi | 18 1209 I gb I AAA52 131 . 1 | (M65188) connexin 43 [Homo 
sapiens] POS=374 (SEQ ID NO: 311) 

25 

G=3636: B7 Mg=1115.2 Tg=25+-1 #H+(4,2) 
S=69(69,72) Mp=1115.3 Tp-25 

genpept PR=>gi | 2 65222 | gb j AAB25312 . 1 | (S54761) beta 
[human, SK-MEL-33 cells, Peptide Mutant, 101 aa] 
30 POS=87 (SEQ ID NO:312) 



P=LLLPDYYLV 
coactivator 



P=YTEFTPTEK 
2 -microglobulin 
[Homo sapiens] 



G=3647: B7 Mg-1117.5 Tg=36+-1 #I+(4,2) 

S=75 (78, 68) Mp=1117 . 5 Tp=31 B7= 0/20 P=KPKTPSLTVF 

genpept PR=>gi | 36124 | emb I CAA4 5125 . 1 | (X63564) RNA polymerase II 
35 largest subunit [Homo sapiens] POS=1133 (SEQ ID NO: 313) 

G=3654: A2 Mg=1119.6 Tg=42+-0 #E+(1,1) #F+(2,2) #EST+(3,1) 

#FR+(1,1) #G+(1,1) 

S=82(90,66) Mp=1119.7 Tp-38 B7= 4/14 P=LFRTQLKTL 

40 genpept PR=>gi | 12 65254 9 I gb I AAH00014 . 1 | AAH00014 (BC000014) Unknown 
(protein for MGC:3265) [Homo sapiens] POS=326 (SEQ ID NO: 314) 



G=3656: A2 Mg=1120.4 Tg=38+-1 #E+(5,3) #F+(1,1) #G+(4,3) 
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S=82(87,72) Mp=1120.6 Tp=40 A2=200/23 P=KMLRLSYPL 

genpept PR=>gi I 4 53952 9 | emb I CAA33099 . 1 I (X14974) CDld antigen [Homo 
sapiens] POS-104 (SEQ ID NO: 315) 

5 G=3659: A2 Mg=1122.5 Tg=40+-1 #G+(3,3> 

5-81(81,83) Mp=1122.5 Tp=38 A2=>lk/30 P=TLLEDGTFKV 

genpept PR->gi | 12804145 I gb| AAH02927 . 1 | AAH02927 (BC002927) HSCARG 
protein [Homo sapiens] POS=23 ( SEQ ID NO: 316) 

10 G=3675: A2 Mg=1126.8 Tg=44+-0 #EST+{2,1) 

S=70(69,74) Mp=1126.6 Tp=42 A2= 61/24 P=HLLEEPI YL 

genpept PR=>gi | 5726241 I gb | AAD48376 . 1 I AF125188_1 (AF125188) adenosine 
deaminase acting on tRNA 1 [Homo sapiens] POS=318 (SEQ ID NO: 317) 

15 G=3676: A2 Mg=1127.3 Tg=54+-0 #K+(1,1) 

S=69(61,90) Mp=1127.6 Tp=47 A2=616/28 P=SLLDEFYKL 

genpept PR->gi | 12 804 617 | gb I AAH01731 . 1 | AAH01731 (BC0017 31) membrane 
component, chromosome 11, surface marker 1 [Homo sapiens] POS=131 
(SEQ ID NO:318) 

20 

G=3679: B7 Mg-1127.5 Tg=36+-1 #I+(1,1) #J+(7,4) 

S-73(76,68) Mp=1127.5 Tp=37 B7-800/26 P=SPRENILVSL 

genpept PR=>gi I 4 808601 | gb | AAD29870 . 1 | AF097 514_1 (AF0 97514) 

stearoyl-CoA desaturase [Homo sapiens] POS=281 (SEQ ID NO: 319) 

25 

G=3689: A2 Mg=1132.3 Tg=45+-0 #F+(1,1) #G+(11,8) 

S=72(74,70) Mp=1132.4 Tp=44 A2=118/22 P=ALIEVPDGFTA 

genpept PR=>gi | 12 65 4 4 73 I gb I AAH01064 . 1 IAAH01064 (BC0010 64) 

hypothetical protein DKFZp547H084 [Homo sapiens] POS=193 (SEQ ID 
30 NO: 320) 

G=3698: A2 Mg-1134.6 Tg=54+-1 #E+(6,2) #FR+(5,2) #G+(6,4) 
S=74 (74,76) Mp=1134.6 Tp=47 A2=>lk/34 P=SLLDRFLATV 

genpept PR=>gi I 12653303 | gb I AAH00420 . 1 I AAH00420 (BC000420) cyclin I 
35 [Homo sapiens] POS=71 (SEQ ID NO: 321) 

G=3709: B7 Mg=1140.4 Tg=30+-1 #H+(4,2) #I+(5,3) 

S=81(90,60) Mp=1140.6 Tp=30 B7= 24/24 P=QPDQTRIVAL 

genpept PR=>gi | 3399667 | gb I AAC28 91 3 . 1 I (AC005393) FBRL_HUMAN; 34 KD 
40 NUCLEOLAR SCLERODERMA ANTIGEN [Homo sapiens] POS=236 (SEQ ID NO: 322) 

G=3745: B7 Mg=1153.6 Tg=45+-1 #I+(7,3) #J+(12,4) 
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S=84(92,66) Mp=1153.6 Tp=4 6 B7= 60/24 P=APRTVLLLLSA 

genpept PR=>gi | 4 56654 0 I gb I AAD23392 . 1 | (AF110255) MHC class I antigen 
[Homo sapiens] POS=5 (SEQ ID NO: 323) 

5 G=3755: A2 Mg=1158.7 Tg=55+-0 #K+(2,1) 

S=73(73,74) Mp=1158.5 Tp=53 A2=581/23 P=YI FEEPFTI 

genpept PR=>gi | 11017 58 I gb | AAA82 938 .11 (M5 9741) phosphof ructokinase-M 
[Homo sapiens] POS=594 (SEQ ID NO: 324) 

10 G=3760: B7 Mg=1160.4 Tg=27+-0 #H+(3,2) 

S=81(83,77) Mp=1160.5 Tp=31 B7= 80/23 P=TPSLVKSTSQL 

genpept PR=>gi | 38 4 32 | emb | CAA4 9533 . 1 | (X69908) P2 gene for c subunit 
of mitochondrial ATP synthase [Homo sapiens] POS=10 (SEQ ID NO: 325) 

15 G=3800: A2 Mg=1198.4 Tg=45+-0 #£+(1,1) #G+(2,2) #K+(1,1) 

S=76(78,74) Mp=1198.7 Tp=46 A2=>lk/25 P=VLI DVGTGYYV 

genpept PR=>gi I 12 957 177 | dbj | BAB32 64 6 . 1 | (AB055805) MM-1 gamma [Homo 
sapiens] POS=37 (SEQ ID NO: 326) 

20 G-3817: B7 Mg=1240.6 Tg=37+-0 #H+(4,2) 

S=81(82,79) Mp=1240.6 Tp=37 B7= 80/23 P-IPYHSEVPVSL 

genpept PR=>gi | 3384 43 | gb I AAA60580 . 1 I (M96803) beta-spect rin [Homo 
sapiens] POS=2247 (SEQ ID NO: 327) 

25 G=3833: B7 Mg=1271.5 Tg=36+-1 #I+(5,3) #J+(1,1) 
S=81(94,52) Mp=1271.5 Tp=21 B7=240/25 

genpept PR=>gi | 2117159 I emb I CAA73697 . 1 | (Y13247) FB19 
sapiens] POS=202 { SEQ ID NO: 328) 

30 G-1497: A2 Mg- 881.5 Tg=45+-1 #0+(7,3) 
S=84(78,99) Mp-881.6 Tp-4 9 

genpept PR=>gi | 1155 94 90 | dbj | BAB18 8 5 9.il 
sapiens] P0S-288 (SEQ ID NO: 329) 

35 G=2420: A2 Mg=1009.3 Tg=53+-0 #0+(4,3) 

S=79(79,79) Mp=1009.6 Tp=50 A2=739/27 P=LLLPDYYL 

genpept PR=>gi | 7 99177 | gb | AAA804 88 . 1 j (U22055) 100 kDa coactivator 
[Homo sapiens] POS=186 (SEQ ID NO: 330) 

40 G=2716: A2 Mg=1048.4 Tg=31+-0 #0+(8,4) 

S=75(82,60) Mp=1048.7 Tp=36 A2=272/29 P=VLVPGHLQSV 

genpept PR=>gi | 6002 919 I gb | AAF001 99 . 1 | AF16227 8_1 (AF1 62278) 



P=APSHAKFRSTGL 
protein [Homo 



A2=567/27 P=VILDLPLV 
(AB051901) VDUP1 [Homo 
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Skn-la/Epoc-l/Oct-11 POU transcription factor [Homo sapiens] POS=98 
(SEQ ID NO:331) 

G=3197: A2 Mg=1244.5 Tg=45+-0 #0+(2,l) 
5 S=77(82,66) Mp=1244.7 Tp=46 A2=260/25 P=LLDPNVKSIFV 

genpept PR=>gi | 12654 4 85 | gb I AAH01072 . 1 | AAH01072 (BC001072) Unknown 
.(protein for MGC:2683) [Homo sapiens] POS=24 (SEQ ID NO:332) 



G=1330: HLA=Cw4 Mg= 926.3 Tg=49+-2 #M+(2,2) #N+(2,2) 
10 S=84(87,79) Mp=926.4 Tp=53 P=IFDLGGGTF 

genpept PR=>gi | 13273304 | gb| AAK17898 . 1 | AF352832_1 (AF352832) 

constitutive heat shock protein 70 [Homo sapiens] POS-197 (SEQ ID 
NO:333) 



15 G=1332: HLA=Cw4 Mg= 928.3 Tg=43+-4 #M+(3,2) #N+(6,3) 

S=80(89,59) Mp=928.5 Tp=45 P=VFDPVPVGV 

genpept PR=>gi | 307383 I gb I AAB4 8855 . 1 | (L13848) RNA helicase A [Homo 
sapiens] POS=697 (SEQ ID NO: 334) 

20 G=1363: HLA=Cw4 Mg= 961.3 Tg=54+-2 #M+(9,4) #N+(22,3) 

S=91(92,89) Mp=961.4 Tp=58 P-FFESFGDL 

genpept PR=>gi I 6425025 I gb I AAF08259 . 1 I (AF186607) beta-globin [Homo 
sapiens] POS-42 (SEQ ID NO: 335) 

25 G-1365: HLA=Cw4 Mg= 964.3 Tg=44+-3 #M+(9,4) #N+(14,3) 

S=82(94,54) Mp=964.3 Tp=4 6 P=AFDPTSTLL 

genpept PR->gi | 12 803 1 15 I gb I AAH02 3 61 . 1 I AAH02361 (BC002361) Similar to 
transducin (beta) -like 3 [Homo sapiens] POS=2 (SEQ ID NO: 336) 

30 G=1383: HLA=Cw4 Mg= 995.3 Tg-47+-2 #M+(7,4) #N+(4,2) 

S=74(86, 49) Mp=995.5 Tp-42 P=HFDMAVYL 

genpept PR=>gi | 18 6913 | gb | AAA5 94 8 6 . 1 j (M61951) laminin Bl [Homo 
sapiens] POS=345 (SEQ ID NO: 337) 

35 G=1399: HLA=Cw4 Mg=1008.2 Tg=45+-1 #M+(7,4) #N+(2,1) 

5=83(96,54) Mp-1008 . 3 Tp=51 P=SFDTGFTSF 

genpept PR=>gi I 54 4 1950 | gb | AAD43194 . 1 | AF075601_1 (AF075601) heat shock 
J2 protein [Homo sapiens] POS=156 (SEQ ID NO: 338) 

40 G=1401: HLA=Cw4 Mg=1011.3 Tg=38+-2 #M+(18,4) #N+(2,2) 

S=80(92,54) Mp=1011.5 Tp=44 P-KFDDGAVFL 

genpept PR->gi | 133654 93 I dbj I BAB39107 . 1 | (AB042196) myosin phosphatase 
target subunit 1 [Homo sapiens] POS=37 (SEQ ID NO: 339) 
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G=1410: HLA=Cw4 Mg=1018.3 Tg=59+-1 #M+(8,4) #N-f(17,3) 
S=77(88,54) Mp=1018.4 Tp=54 P=SMDPLPVFL 

genpept PR=>gi | 34 13884 | dbj | BAA32306 . 1 | (AB007930) KIAA0461 perotein 
5 [Homo sapiens] POS=666 (SEQ ID NO: 340) 

G=141: HLA=Cw4 Mg-1019.4 Tg=25+-0 #M+(2,1) 

S=65(66,63) Mp=1019.5 Tp=36 P=VFDEAI RAV 

genpept PR=>gi | 4490835 I emb I CAB38 832 . 1 | (AL02257 6) dJ2 0J23. 1 

10 ( RAS- RELATED C3 BOTULINUM TOXIN SUBSTRATE 1 (P21-RAC1) (RAS-LIKE 
PROTEIN TC25)) [Homo sapiens] POS=168 (SEQ ID NO:341) 

G-1422: HLA=Cw4 Mg=1028.4 Tg=54+-1 #M+(3,2) #N+(1,1) 
S=76(82,63) Mp=1028.3 Tp=53 P=LFDPMTGTF 

15 genpept PR=>gi j 239578 | gb I AAB204 13 . 1 | (S67859) general transcription 
factor HE 56 kda subunit; TFIIE 56 kda subunit [Homo sapiens] 
POS=144 (SEQ ID NO: 342) 

G=1426: HLA=Cw4 Mg=1032.4 Tg=43+-14 #M+(5,3) 
20 S=80(84,72) Mp=1032.4 Tp=37 P=SFDAHLTEL 

genpept PR=>gi | 2 661083 | gb | AAB8818 9 . 1 j (AF035319) similar to 
S-adenosylhomocysteine hydrolase [Homo sapiens] POS=187 (SEQ ID 
NO: 34 3) 

25 G=1430: HLA=Cw4 Mg=1035.4 Tg=36+-6 #M+(15,4) #N+(10,3) 

S=83(88,72) Mp=1035.5 Tp=42 P=VFDKTLAEL 

genpept PR=>gi | 5101772 | emb I CAB45135 . 1 | (AJ242978) p621 . [Homo sapiens] 
POS=70 (SEQ ID NO:344) 

30 G-1449: HLA=Cw4 Mg=1051.4 Tg=31+-6 #M+(15,4) #N+(9,2) 

S=86(92,72) Mp=1051.5 Tp=38 P=IFDSKVTEI 

genpept PR=>gi I 14 036118 | emb | CAC38 655 . 1 I (AX119163) unnamed protein 
product [Homo sapiens] POS=21 (SEQ ID NO: 345) 

35 G=1457: HLA=Cw4 Mg=1056.2 Tg=48+-0 #M+ ( 8 , 4 ) 

S=73(67,90) Mp-1056.4 Tp=45 P=AYDPYLIAM 

genpept PR=>gi | 12804 913 I gb I AAH0190 9 . 1 I AAH01909 (BC001909) Unknown 
(protein for IMAGE : 35374 47 ) [Homo sapiens] POS=80 (SEQ ID NO:346) 

40 G=1461: HLA=Cw4 Mg-1058.3 Tg=38+-4 #M+(6,3) #N+(16,3) 

S=73(80,57) Mp=1058.4 Tp=39 P=YFDPANGKF 

genpept PR=>gi | 31108 I emb I CAA77750 . 1 ! (Z11692) human elongation 
factor 2 [Homo sapiens] POS=265 (SEQ ID NO: 347) 
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G=1465: HLA=Cw4 Mg=1063.4 Tg=21+-0 #M+(4,3) 

3=79(82,72) Mp=1063.6 Tp=39 P=LFDHAVSKF 

qenpept PR=>gi | 2 9600 69 I emb | CAA73314 . 1 | (Y12777) acyl-CoA 

5 synthetase-like protein [Homo sapiens] POS=41 (SEQ ID NO: 348) 

G=1475: HLA=Cw4 Mg=1071.4 Tg=27+-1 #M+(6,3) 

S=84<93,63) Mp=1071.6 Tp=37 P=VYDGKIYTL 

genpept PR->gi | 15559254 | gb | AAH13982 . 1 1 AAH13982 (BC013982) Unknown 
10 (protein for IMAGE : 313904 3 ) [Homo sapiens] POS=512 (SEQ ID NO: 349) 

G=14 78 : HLA=Cw4 Mg-1072 . 3 Tg=53+-1 #M+ (6, 4) #N+ (2, 2) 
S=83<88,74) Mp=1072.6 Tp=48 P=HFDLSVIEL 

genpept PR=>gi | 418 5277 | gb| AAD08 998 . 1 j (AF081535) CDC4 5L [Homo 
15 sapiens] POS=541 (SEQ ID NO: 350) 

G=1489: HLA=Cw4 Mg=1080.4 Tg=45+-3 #M+(9,4) #N+(8,3) 

S=82<91,63) Mp=1080.5 Tp=43 P=TFDDIVHSF 

genpept PR=>gi | 104 9053 | gb I AAC50259 . 1 | (U26644) encodes region of 
20 fatty acid synthase activity; FAS; multifunctional protein [Homo 
sapiens] POS=544 (SEQ ID NO:351) 

G=1494 : HLA-Cw4 Mg=1083. 3 Tg=32+-1 #M+(7, 3) 

5-79(85,66) Mp=1083.6 Tp=36 P=HFDPEVVQI 

25 genpept PR=>gi | 7582302 | gb | AAF64 271 . 1 | AF208857_1 (AF208857) BM-015 
[Homo sapiens] POS=144 (SEQ ID NO: 352) 

G=1498: HLA=Cw4 Mg=1091.3 Tg=54+-1 #M+(5,4) #N+(3,2) 
S=83(92 r 63) Mp=1091.5 Tp-54 P=SYDLFVNSF 

30 genpept PR->gi | 13185303 | emb | CAC33320 . 1 | (AX083457) unnamed protein 
product [Homo sapiens] POS=19 (SEQ ID NO: 353) 

G=1504: HLA=Cw4 Mg=1096.4 Tg=47+-1 #M+(15,4) #N+(3,2) 
S=70(78,54) Mp=1096.6 Tp=49 P=TFDLQRIGF 

35 genpept PR=>gi | 1353700 I gb I AAB1 8 37 5 . 1 I (U42360) N33 protein form 1 
[Homo sapiens] POS=165 (SEQ ID NO: 354) 

G=1505: HLA=Cw4 Mg=1098.3 Tg=27+-7 #M+(2 / l) #N+(1,1) 

S=73(77,66) Mp=1098.5 Tp=31 P=KYDPNVYSI 

40 genpept PR=>gi | 34 0307 | gb | AAA36808 . 1 | (M14648) vitronectin alpha 
subunit precursor [Homo sapiens] POS=224 (SEQ ID NO: 355) 

G=1523: HLA=Cw4 Mg=1109.4 Tg=64+-1 #N+(16,3) 
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S=78 (91,49) Mp=1109.5 Tp=67 P-FWPSLPSYL 

genpept PR=>gi | 12803527 I gb ! AAH02591 . 1 | AAH02591 (BC002591) matrix 
metalloproteinase 10 ( stromelysin 2) [Homo sapiens] POS=331 (SEQ ID 
NO: 356) 

5 

G=1527: HLA=Cw4 Mg=1116.4 Tg=18+-0 #M+(6,3) 

S=79(89,57) Mp=1116.6 Tp=28 P=YYDGKVMKL 

genpept PR->gi | 339804 I gb I AAA61206 . 1 I (M60706) topoisomerase I [Homo 
sapiens] POS=241 (SEQ ID NO: 357) 

10 

G=1532: HLA=Cw4 Mg=1123.3 Tg=53+-1 #N+(4,2) 
3=69(81,41) Mp-1123.5 Tp=52 

genpept PR=>gi | 38822 4 9 I dbj I BAA34 4 84.il (AB018 307) 
[Homo sapiens] POS=21 (SEQ ID NO: 358) 

15 

G=1533: HLA=Cw4 Mg=1124.4 Tg=33+-0 #M+(6,3) 
S=79(82,74) Mp=1124.5 Tp=37 P=HWDPQEVTL 

genpept PR->gi I 13543938 | gb | AAH06111 . 1 | AAH06111 (BC006111) Similar to 
RIKEN cDNA 4930555L11 gene [Homo sapiens] POS=43 (SEQ ID NO: 359) 

20 

G=1541: HLA=Cw4 Mg=1132.4 Tg=44+-0 #M+(2,1) 

S=68(78,45) Mp=1132.6 Tp=44 P=VFDEAIRAVL 

genpept PR=>qi | 8574 039 1 emb I CAA10733 . 6 I (AJ132695) Raclb protein [Homo 
sapiens] POS=187 (SEQ ID NO:360) 

25 

G=1542: HLA=Cw4 Mg=1133.4 Tg=43+-2 #M+(4,3) #N+(2,2) 
S=67(65,74) Mp=1133.7 Tp=45 P=IYDSVKVYF 

genpept PR=>gi I 1507 964 8 I gb I AAH1164 1 . 1 | AAH11 64 1 (BC011641) Similar to 
solute carrier family 25 (mitochondrial carrier; phosphate carrier) , 
30 member 3 [Homo sapiens] POS=331 (SEQ ID NO: 361) 

G=1547: HLA=Cw4 Mg=1139.4 Tg=40+-5 #M+(5,3) #N+(5,3) 
S=73(80,57) Mp=1139.7 Tp=44 P=FYDERIVVV 

genpept PR->gi | 14 04 3353 | gb I AAH07 67 1 . 1 | AAH07 67 1 (BC007671) Similar to 
35 CG7020 gene product [Homo sapiens] POS=217 (SEQ ID NO: 362) 

G=1551: HLA=Cw4 Mg=1141,6 Tg-43+-l #M+(6,3) 

S=71(74,66) Mp=1141.6 Tp=46 P=KWPDRITLL 

genpept PR=>gi | 1 94 52 7 1 | emb I CAA6354 9 . 1 I (X92972) protein phosphatase 6 
40 [Homo sapiens] POS=102 (SEQ ID NO: 363) 

G=1561: HLA=Cw4 Mg=1156.5 Tg=16+-0 #M+(4,3) 



P=SFDLLPREF 
KIAA0764 protein 
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5=82(90,66) Mp=1156.7 Tp=28 P=YYDEKVVKL 

genpept PR=>gi | 14 318 634 I gb I AAH0 9115 . 1 | AAH0 9115 (BC00 9115) Unknown 
(protein for IMAGE : 4 154008 ) [Homo sapiens] POS-387 (SEQ ID NO:364) 



10 



15 



20 



25 



30 



G=1562: HLA=Cw4 Mg=1163.5 Tg=50+-2 #M+(7,3) #N+(5,3) 

S=68(78,45) Mp=1163.7 Tp=27 P=LFDKHKTKF 

genpept PR=>gi | 15099953 I gb I AAK84 17 6 . 1 I AF384 161_1 (AF38 4161) 

diacylglycerol acyltransf erase 2 [Homo sapiens] POS=368 (SEQ ID 

NO:365) 

G=1567: HLA=Cw4 Mg=1181.4 Tg=37+-4 #M+(6,3) #N+(7,2) 

S=62(71,41) Mp=1181.7 Tp=38 P=YYDPKHVIF 

genpept PR->gi I 70237 14 I dbj I BAA92063 . 1 I (AK002060) unnamed protein 
product [Homo sapiens] POS=500 (SEQ ID NO: 366) 

G=1570: HLA=Cw4 Mg=1186.3 Tg=34+-0 #M+(6,3) 

S=86(98,60) Mp=1186.3 Tp=41 P=SYDPTIENTF 

genpept PR=>gi I 453470 | emb | CAA82774 . 1 | (Z29677) Ras-related 

GTP-binding protein [Homo sapiens] POS=34 (SEQ ID NO: 367) 



G=1574: HLA=Cw4 Mg=1192.3 Tg=71+-2 #N+(6,3) 
S=67(76,49) Mp=1192.6 Tp=66 

genpept PR=>gi | 702374 9 | dbj I BAA92075.il (AK002082) 
product [Homo sapiens] POS=373 (SEQ ID NO: 368) 

G=1577: HLA=Cw4 Mg=1207 . 3 Tg=43+-0 #M+(5,3) 
S=77(92,45) Mp=1207.5 Tp=42 

genpept PR=>gi | 7020377 | dbj I BAA91103. 1 | (AK000350) 
product [Homo sapiens] P0S=187 (SEQ ID NO: 369) 

G=1588: HLA=Cw4 Mg=1255.1 Tg=61+-2 #N+(6,3) 
S=61(65,53) Mp=1255.6 Tp=60 

genpept PR=>gi | 7 95 98 93 | gb | AAF71117 . 1 |AF116721_97 
[Homo sapiens] POS=61 (SEQ ID NO:370) 



P=YLPDFLDYF 
unnamed protein 



P=RFDEAYIYM 
unnamed protein 



P=YFDPQYFEF 
(AF116697) PR02242 



35 



DISCUSSION 

Among the thousands of different peptides presented within the context 
of the MHC class-I on cancer cells, only a few may eventually become 
40 candidates for the development of anti-cancer vaccines. The identification of 
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such cancer specific peptides depends on sequencing a relatively large number 
of peptides. 

While reducing the present invention to practice, a novel method was 
developed to identify candidate peptides for the development of anti-cancer 
vaccines. The novel method involves expressing the soluble extra-cellular 
domain of the MHC molecules that are simple to purify and the recovery, from 
them, large amounts of MHC bound peptides ready for identification by 
ESI-MS/MS. 

Purification of the extra-cellular domain of MHC was previously 
achieved by truncating its entire transmembrane and cytoplasmic domains 
[30], by using a non- functional transmembrane domain such as Q10^ [24] or 
fusing the extra-cellular domains to soluble secreted proteins such as 
antibodies Fc domains [31, 32]. Such sMHC molecules were produced in 
cultured cells of murine [33], human [30, 34] or insect [35] and in bacteria 
[36]. The soluble MHC molecules expressed by the murine or the human cells 
were capable of binding to their cognate TCRs, indicating the presence of 
bound authentic peptides that mediate this interaction [33, 37]. Bound 
peptides recovered from the secreted murine MHC H-2Ld were analyzed by 
Edman sequencing [38]. More recently, peptides recovered from the murine 
Q2/Q10b, which is a natural mutation resulting in the formation of soluble and 
secreted MHC molecules, were analyzed by ESI-MS/MS [39]. The results, 
however, were very disappointing as only six peptides were recovered from 50 
liters of culture medium [39]. 

While culture cancer cell lines are invaluable model for cancer research, 
only a limited number of good model lines are available for the study of tumor 
immunology since some of the better model cell lines have rare MHC 
haplotypes or down regulated MHC expression altogether. The introduction of 
foreign MHC into such cells in accordance with the teachings of the present 
invention facilitates the use of the desired model cell lines for the search for 
cancer specific MHC bound peptides. The recovery of secreted MHC from the 
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growth medium helps to sidestep possible interference by the cell's 
background MHC haplotypes. 

The number of peptides identifiable during each ESI-MS/MS run 
performed in accordance with the present invention was limited by the rate the 
mass spectrometers can switch between measuring the full spectrum to 
performing CID, which was about four seconds. Therefore, during a 
chromatography of ninety minutes, around a thousand different peptides could 
be mass measured and fragmented. The elution order of most of the peptides 
recovered for MHC of a particular type and resolved in different 
chromatography runs was similar. Therefore, their masses and CID data were 
combined in order to improve their signal-to-noise ratio. 

About one thousand different molecules that are certainly peptides have 
been fragmented at least twice in all the different chromatographs and out of 
these about two hundreds different peptides have been identified at high 
certainty. Most of these peptides were derived from housekeeping proteins 
and only a few were derived from proteins related to cancer. To increase the 
likelihood of identifying more new cancer specific peptides, the total number 
of identified peptides should be further enlarged. Identification of large 
number of peptides is currently limited by both the availability of sufficient 
amounts of peptides, by the capabilities of the mass spectrometers and by the 
non-completeness of the databanks. With the expected near availability of the 
entire human genome sequence, it is expected that more of the peptides will be 
identifiable, excluding mutant peptides that will still need to be sequenced de 
novo. 

The soluble and secreted MHC molecules described here present 
similar patterns of peptides as do the original cell surface MHC. This 
conclusion emanates from the observation that most of these peptides, posses 
an amino acid sequence that fit the known sequence consensus of HLA-A2.1 
and of B7 (see score columns in Table 8 above). Some of the peptides have 
been identified previously as MHC bound peptides and thus indicate the 
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validity of the methodology of the present invention. The most significant 
advantage of the use of secreted MHC as a source for peptides for analysis has 
to do with the order of magnitude larger recovery of sMHC molecules and 
therefore peptides per cell over the alternative purification from detergent 
5 solubilized cells and the purified sMHC molecules were free of interfering 
cellular debris and detergents. 

Direct biochemical analysis of peptides eluted from MHC molecules 
that are recovered from cancer cells, allows unbiased identification of those 
peptides that are actually presented by the MHC. Even though, identifying 

10 putative MHC bound peptides using computer programs based on the 
consensus motifs followed by synthesizing them and testing their 
immunogenicity, bypasses the reliance on expensive and technically 
demanding mass-spectrometry needed for biochemical analysis of MHC bound 
peptides. However, the motif prediction approach is dependent on the 

15 availability of well-established consensus for the MHC allele of interest and is 
hampered by the difficulty of taking into account the processing machinery 
involved in generating the peptides and transporting them to the MHC [13]. 
Moreover, it was suggested that contaminating protecting groups inadvertently 
left on the synthetic peptides are very immunogenic and may become the target 

20 for the activity of the CTLs. The CTLs generated in vitro are often low 
affinity binders and incapable of recognizing the rare peptides actually 
presented by the cancer cells in vivo [10]. 

The examples of identified peptides listed in Table 8 above include 
peptides that do not fit the accepted consensus of MHC bound peptides 

25 presented by the studied MHC haplotypes. Peptides longer than ten amino 
acids are not expected to be common among MHC class-I peptides [40, 41]. 
However, in this study, few peptides of 1 1 amino acids (pi 2 10, SEQ ID NO: 3 
and pl258, SEQ ID NO:24) and 12 amino acids (pi 360, SEQ ID NO:25) long 
were observed among the identified peptides. The computer programs for 

30 motif predictions of class-I peptides are not able to predict such peptides as 
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their length is outside the consensus [42, 43]. The detection of longer peptides 
among the peptides in the natural mixture suggests that the consensus motif 
should possibly be extended to include such outliers. Another interesting 
observation is the relatively abundance of peptides that originated from 
overlapping parts of the proteins with one or two amino acids difference in 
length such as p800 (SEQ ID NO:4) and p913 (SEQ ID NO:5) from p-catenin, 
pi 145 (SEQ ID NO:23) and pl258 (SEQ ID NO:24) from fatty acid synthase, 
p898 (SEQ ID NO:l), plOll (SEQ ID NO:2) and pl210 (SEQ ID NO:3) from 
IP30 (Table 8). Moreover, peptides pl080 (SEQ ID NO:19) and pl094 (SEQ 
ID NO:21) are derived from homologous site in two alleles of the same 
protein. This observation points to the existence of structural hotspots for 
generation of peptides, possibly as a result of heath-shock proteins binding and 
protection from complete proteolysis of these regions. Differences in length 
could also result from incomplete trimming of the peptides in the endoplasmic 
reticulum [44, 45]. 

Interesting observations are the large similarities between the patterns 
of peptides produced by cell lines of different tissue origin and on the other 
hand, the presence of a few peptides that are unique to one type of cancer cells. 
The ability to characterize the similarities and differences between peptide 
patterns of different cell lines and growth conditions and between different 
HLA haplotypes are among the most important possible uses of the novel 
methodology presented herein. 

The most effective mean to ascertain the identity of the amino acid 
sequences of peptides that were identified by this method is to compare their 
retention times, their exact masses and their CID data to those of the 
corresponding synthetic peptides [16, 39, 46, 47]. The sequences of all the 
peptides that were identified at high confidence by searching the databank with 
their mass spectrometry data were shown to be correct when these parameters 
were compared with the corresponding synthetic peptides. 



113 

A number of peptides identified here were derived from known tumor 
antigens. Those peptides that attracted the attention as possibly cancer specific 
were chemically synthesized and tested again. The fact that a few of them 
elicited a CTL response in mice may point to their possible immunogenicity in 
5 human. 

Tumor proteins from which identified peptides were derived included 
mucin (MUC1), a well-studied tumor-associated antigen that is up regulated in 
breast and ovarian carcinomas [48]. A number of HLA-A2.1 restricted 
MUC1 -derived CTL epitopes were identified by the motif prediction approach 
10 [26, 49-52]. Peptide p947 (NLTISDVSV, SEQ ID NO:8) identified here from 
breast carcinoma cells (MCF-7) is the same peptide that was predicted and 
confirmed to be a HLA-A2 antigen originating from MUC1 by Cannon et al. 
[26]. 

Another peptide derived from a known tumor antigen, was pi 091 (SEQ 
15 ID NO:20) from the testis-cancer antigen MAGE-B2. It belongs to a group of 
2 1 known genes that are essentially silent in most normal cells except for testis 
and placental trophobalsts and since different member of the MAGE proteins 
are expressed in a variety of tumors, they attracted significant attention as 
cancer vaccine candidates [53-57]. A few peptides were identified so far from 
20 the MAGE proteins by genetic approach and by predicting their sequence 
based on the known motifs rather than by the biochemical approach [27, 28, 
58-61] (reviewed in [10]). The identification of the novel MAGE-B2 derived 
peptide pi 091 (GVYDGEEHSV) (SEQ ID NO:20) by the direct biochemical 
approach is a very encouraging observation that confirmed the validity of this 
25 method for identification of novel tumor specific antigens. Homologous 
peptides from MAGE-A4 and MAGE-A10 proteins were previously identified 
as MHC bound peptides and tested for their immunogenicity (see Figure 5D). 
This suggests the existence of a possible hot spot within the MAGE protein for 
processing as MHC bound peptides [27, 28]. 
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Peptides derived from other proteins that are involved with cancer 
progression and may also serve as candidates for anti-cancer vaccines of 
diagnosis include p913 (SEQ ID NO: 5) from p-catenin, which is normally 
involved in cellular adhesion, signal transduction and as a transcription 
enhancer with a possible oncogenic role in colorectal cancer. Abnormal high 
amounts of the protein were found in the cytoplasm in cancer cells instead of 
the intracellular boundary in normal cells and this abnormal behavior was 
correlated with metastasis [62-64]. Peptide pi 145 (SEQ ID NO:23) and pl258 
(SEQ ID NO:24) is derived form fatty acid synthase (FAS), a biosynthetic 
enzyme expressed in liver and lactating breasts and is a marker of poor 
prognosis when expressed in colon, prostate, ovarian, breast and endometrial 
cancers. Its significance for cancer is was established by inhibiting it activity, 
which leads to apoptosis in cancer cells [65-69], The enzyme DNA methyl 
transferase (MTDM) is the source protein for pl028 (SEQ ID NO: 13) an 
enzyme that is highly expressed in different cancer cell types, including 
prostate and breast [70-72]. Increased MTDM activity is usually associated 
with tumor progression and is considered to be an important event in cell 
transformation [71, 73]. 

Once tumor specific MHC bound peptides are identified and their 
ability to stimulate an immune response is demonstrated, such peptides 
become candidates for adoptive immunotherapy. Identification of peptides 
originating from normal proteins that are uniquely expressed in non-vital 
organs, such as breast, prostate and ovaries can become very useful for 
immunotherapy of these cancers. The potential usefulness of identified 
immunogenic peptides should be evaluated by the presence of specific T cells 
directed against them in patients inflicted with the particular cancer using 
standard assays such as ELISPOT and CTL. The assay of immunizing mice 
with the peptides described herein was meant to serve first as validation that 
these peptides are indeed MHC bound peptides with affinity for the HLA-A2.1 
and as the preliminary indication of their immunogenic potential. 



115 

Secreted soluble MHC such as described herein can also be used for 
analysis of peptides presented by cells involved with pathologies other than 
cancer, such as autoimmune diseases and viral infections with the aims of 
identifying peptides of significance for treating these diseases. The method 

5 can also be used for identifications of MHC bound peptides presented on 
normal cells of specific tissues, peptides presented by particular MHC alleles 
and peptides originating from expression of particular proteins of interest. 
Moreover, the approach can be used for analysis of MHC bound peptides 
derived from over-expression of specific proteins, from induced mutations, as 

10 a result of metastasis progression and as a way for searching for peptides 
derived from signal peptides of cell surface proteins. The approach described 
in this study is also useful for comparisons between patterns of MHC bound 
peptides induced by minor changes in the cells growth conditions such as the 
addition of hormones, the expression of a foreign protein or under stress 

15 conditions. 

Therefore, an appealing outcome of the methodology described herein 
is that the simple expression of different recombinant MHC molecules in 
different cell lines in a soluble, secreted form and their easy recovery from the 
growth medium with their peptides still attached, followed by comprehensive 
20 analysis of the peptides may become a good staging point for above listed 
ambitious research projects. Such 'human MHC-peptide projects' are worthy 
goals to follow the human genome and proteome projects. 

It is appreciated that certain features of the invention, which are, for 
25 clarity, described in the context of separate embodiments, may also be 
provided in combination in a single embodiment. Conversely, various features 
of the invention, which are, for brevity, described in the context of a single 
embodiment, may also be provided separately or in any suitable 
subcombination. 
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Although the invention has been described in conjunction with specific 
embodiments thereof, it is evident that many alternatives, modifications and 
variations will be apparent to those skilled in the art. Accordingly, it is 
intended to embrace all such alternatives, modifications and variations that fall 
within the spirit and broad scope of the appended claims. All publications, 
patents and patent applications mentioned in this specification are herein 
incorporated in their entirety by reference into the specification, to the same 
extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated herein by reference. 
In addition, citation or identification of any reference in this application shall 
not be construed as an admission that such reference is available as prior art to 
the present invention. 
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